Google DeepMind introduced AlphaGenome, a deep-learning model designed to decode non-coding DNA and predict its effects on gene activity and RNA output. Trained on public human and mouse datasets, the model analyses sequences up to one million nucleotides at high resolution. It has been tested by about 3,000 scientists in 160 countries and is available for non-commercial research, though experts caution it remains limited by training data and cannot capture all environmental influences.
Google Unveils AlphaGenome: AI That Reads Up To One Million Letters Of The Human Genome

Google on Wednesday introduced AlphaGenome, a new deep-learning tool from DeepMind designed to decode the functional role of non-coding DNA and accelerate research into genetic disease mechanisms. Researchers say the model could help scientists map regulatory elements in the genome and simulate how genetic variants affect cellular processes — potentially guiding future therapeutic development.
Pushmeet Kohli, Vice President of Research at Google DeepMind and co-author of the study published in Nature, said that while the first complete human genome sequence provided the "text" of life, understanding the genome's "grammar" is the next major challenge. The genome consists of roughly three billion nucleotide pairs represented by the letters A, T, C and G; only about 2% of those pairs code for proteins. The remaining non-coding DNA, once labelled "junk," is now known to regulate when, where and how genes are expressed.
A Million Letters
AlphaGenome was trained on public datasets that measured non-coding DNA activity across hundreds of human and mouse cell and tissue types. The model can analyse very long DNA stretches — up to one million nucleotide letters — and predict, at high resolution, how each position influences cellular processes. Outputs include whether genes are likely to be switched on or off and estimates of RNA production, which transmits genetic instructions inside cells.
Unlike many previous tools that either examine much shorter sequences or produce lower-resolution predictions, AlphaGenome aims to capture the broader regulatory environment around genes while preserving fine-grained detail. That high resolution allows researchers to compare mutated and non-mutated sequences to study the likely molecular effects of genetic variants.
Expert Reactions: Promising but Cautious
Outside scientists praised the advance while urging caution. Ben Lehner of the University of Cambridge, who tested the model, said AlphaGenome "does indeed perform very well" and called precise identification of disease-associated variants an important step toward better therapeutics. He added, however, that the model is "far from perfect" and limited by the quality and scope of available training data.
"AI models are only as good as the data used to train them," Lehner noted.
Robert Goldstone, head of genomics at the Francis Crick Institute, warned that AlphaGenome is "not a magic bullet for all biological questions," since gene expression is also influenced by complex environmental factors the model cannot observe. Still, he described the tool as a breakthrough that will help scientists study and simulate the genetic roots of complex disease.
Availability And Next Steps
Google says AlphaGenome has already been trialled by roughly 3,000 scientists across 160 countries and is available for non-commercial research use. Researchers hope to extend the model with additional data and experimental validation to improve accuracy and broaden its applicability. AlphaGenome joins Google’s broader AI-driven scientific efforts, including AlphaFold, which helped win the 2024 Nobel Prize in Chemistry.
Bottom line: AlphaGenome represents a significant step toward interpreting the vast non-coding regions of the genome, offering high-resolution predictions across very long sequences. It promises to accelerate genomic research, but real-world biological complexity and data limitations mean substantial further work and validation are still required.
Help us improve.

































