作者: M. Gymrek , A. L. McGuire , D. Golan , E. Halperin , Y. Erlich
关键词:
摘要: Sharing sequencing data sets without identifiers has become a common practice in genomics. Here, we report that surnames can be recovered from personal genomes by profiling short tandem repeats on the Y chromosome (Y-STRs) and querying recreational genetic genealogy databases. We show combination of surname with other types metadata, such as age state, used to triangulate identity target. A key feature this technique is it entirely relies free, publicly accessible Internet resources. quantitatively analyze probability identification for U.S. males. further demonstrate feasibility tracing back high identities multiple participants public projects.