SPASpatial Ancestry analysis (SPA) is a method for predicting ancestry or where an individual is from using the individual's DNA. Accurately modeling ancestry is an important step in identifying genetic variation involved in disease. SPA was created by Wen-Yun Yang, John Novembre, Eleazar Eskin and Eran Halperin For general questions, comments or suggestions about SPA, please email Eleazar Eskin (eeskin (AT) cs.ucla.edu). ManualThe full manual is available here Online user's guide
usage: spa [options] --bfile <FILE> prefix of .bed, .bim and .fam file --pfile <FILE> prefix of .ped and .map file --tfile <FILE> prefix of .tped and .tfam file --gfile <FILE> genotype file --mfile <FILE> 23andMe genotype file --location-input <FILE> known locations --model-input <FILE> known slope functions --location-output <FILE> output file for individual locations --model-output <FILE> output file for slope function coefficients and SPA score -n number of locations (default 1) -k dimensions of spatial analysis (default 2) -e set tolerance of termination criterion (default 0.01) -r set optimization epsilon tolerance (default 1e-6). Larger value makes the program run faster but poor accuracy -v verbose level (default 1)
<---- myfile.ped ----> <--- myfile.map ---> 1 1 0 0 1 1 A A G T 1 snp1 0 5000650 2 1 0 0 1 1 A C T G 1 snp2 0 5000830 3 1 0 0 1 1 C C G G 4 1 0 0 1 2 A C T T 5 1 0 0 1 2 C C G T 6 1 0 0 1 2 C C T T <------------- myfile.tped -------------> <- myfile.tfam -> 1 snp1 0 5000650 A A A C C C A C C C C C 1 1 0 0 1 1 1 snp2 0 5000830 G T G T G G T T G T T T 2 1 0 0 1 1 3 1 0 0 1 1 4 1 0 0 1 2 5 1 0 0 1 2 6 1 0 0 1 2 <- myfile.geno -> 2 1 1 1 0 2 1 0 0 1 0 0
<-------- myfile.loc ---------> 1 1 0 0 1 0 0.23424 -0.43045 2 1 1 3 1 0 0.92378 -0.23442 3 1 0 0 0 1 1.20334 0.23234 4 1 0 0 1 0 -0.23435 -0.95965 5 1 0 0 0 1 -0.23675 0.43485 6 1 4 5 0 0 -1.03294 0.30438 <--------------------- myfile.model ----------------------> 1 snp1 0 5000650 C A 0.93454 0.21351 0.02342 1.09772 1 snp2 0 5000830 T G 0.23432 0.34849 0.95853 0.92843 |