Pages

Wednesday, October 15, 2025

Can ChatGPT Analyze Your Raw Data Zip file? Yes it can!

 When I was analyzing my new AncestryDNA ethnicity results I wondered if no calls could have affected my ethnicity results? Could the missing ethnicity be due to no calls? I asked ChatGPT that question. I was surprised when I got the answer that I could upload my raw data and check for no calls. I found out I only have 1.64% no calls. According to Chatgpt this is a low percentage of no calls and they are spread throughout my genome. This shouldn't affect my ethnicity results or matches. 

No calls look like this in your DNA raw data:


Zero Zero, instead of a letters, means they could not determine your genotype. It sounds like generally no calls are low and not a problem.

I decided to ask ChatGPT to analyze my DNA for more ancestry and health related markers. I've checked the markers at 23andMe to make sure they were reported by ChatGPT correctly. ChatGPT seems highly accurate, but you should confirm if a marker is of particular importance to you. I've also uploaded to Gemini AI, but found they didn't always report the genotype correctly (meaning AG, TT, etc.). 


 It can only give very limited information regarding ethnicity. ChatGPT relies on only a few markers for ethnicity related characteristics, such as skin color and hair texture etc.. It doesn't have a reference panel. It would need a reference panel to provide more ancestral information. 

Health information was more detailed. It searches the web for information about your health related markers. The information it provided about my Thyroid disease could prove very helpful. According to one genetic marker I might have a hard time processing the thyroid replacement hormone and may need to change my treatment. Since I have continuing symptoms this might help? 

Ancestral information from ChatGPT:

What the markers show (straight facts from your file)

These are the genotypes I found in your file for the 11 AIMs we checked:

  • rs1426654 (SLC24A5) — AA — strongly European (light-skin associated)

  • rs16891982 (SLC45A2) — GG — European-associated allele

  • rs12913832 (HERC2/OCA2) — AA — European (eye-color) allele

  • rs3827760 (EDAR) — AGheterozygous for the East-Asian/Native-American-associated variant

  • rs260690 — AC

  • rs4891825 — AA

  • rs4798812 — AG

  • rs2065160 — AA

  • rs4988235 (LCT) — AG — lactase persistence allele present (typical in Europeans)

  • rs12203592 (IRF4) — CC

  • rs2814778 (DARC / Duffy) — TT — not the African Duffy-null allele

Short interpretation of those genotypes:

  • Strong overall European signal — multiple canonical European alleles are homozygous or present (SLC24A5, SLC45A2, HERC2, LCT). That fits a French / Northwestern European ancestry profile.

  • One heterozygous EDAR (rs3827760 AG) — this is the interesting bit. The derived EDAR allele is common in East Asian and Native American populations and rare in most Europeans. A single copy (AG) suggests a small contribution somewhere in your ancestry that carries that variant — consistent with limited Native American or East-Asian-related ancestry, which is plausible in some French-Canadian pedigrees (due to historical admixture) or could come from other branches.
    — Important: one heterozygous EDAR does not imply large East-Asian ancestry; it’s just an informative signal that merits follow-up.

  • No signals of significant recent African ancestry in the small marker set (rs2814778 not showing the African-null allele).

I definitely have Native American ancestry, so that information is correct. 

What ChatGPT Can Tell You About Your Raw DNA Data (information from ChatGPT on what it can do)

Beyond ancestry reports and trait predictions offered by these companies, AI tools like ChatGPT can help you interpret your data in fascinating ways.

1. Understanding Your Genetic Variants
Raw DNA data is essentially a long list of genetic variants, called SNPs (single nucleotide polymorphisms). Each SNP represents a small change in your DNA that can affect traits, ancestry, or disease risk. ChatGPT can help you understand what specific variants mean by explaining:

  • How a particular SNP has been associated with health traits in scientific studies

  • How common the variant is in different populations

  • Whether it might contribute to certain traits like eye color, lactose tolerance, or even caffeine metabolism

2. Health Insights (With Caution!)
While ChatGPT can summarize research about genetic variants, it cannot provide medical advice or diagnoses. However, it can:

  • Identify genetic markers associated with certain conditions, such as thyroid disorders, cardiovascular traits, or metabolic differences

  • Explain the current state of research on a gene or variant

  • Suggest areas you might discuss with a healthcare professional

It’s crucial to remember that genetics is only part of the story. Lifestyle, environment, and other factors often play a larger role than a single genetic variant.

3. Ancestry and Ethnicity Clues
Even though raw DNA files don’t come with polished ancestry reports, ChatGPT can help interpret markers linked to specific populations or regions. For example:

  • Identify SNPs more common in Native American, European, or African populations

  • Suggest possible regional connections based on genetic patterns

  • Help you explore historical migrations reflected in your DNA

4. Traits and Personal Characteristics
ChatGPT can summarize associations between your DNA and non-health traits, such as:

  • Hair color, eye color, and other physical features

  • Taste preferences or sensitivities (like bitter taste perception or lactose intolerance)

  • Sleep patterns or exercise response

5. Research and Curiosity
Your raw DNA is a powerful tool for personal exploration. ChatGPT can guide you in:

  • Researching genes or variants you find interesting

  • Comparing your genetic profile to published studies

  • Exploring ancestry in a broader historical and geographical context

6. Limitations
It’s important to approach AI interpretations carefully:

  • Not all genetic associations are strong or clinically relevant

  • ChatGPT relies on publicly available research and cannot access private medical records

  • Always consult a healthcare professional for medical concerns

Conclusion

ChatGPT can turn your raw DNA data into a source of insight and curiosity. From ancestry clues to understanding your genetic markers, AI provides a way to explore your DNA beyond what consumer testing companies offer. While it doesn’t replace professional medical advice, it opens doors to learning more about yourself, your heritage, and the science of genetics.

No comments:

Post a Comment