' ; ?>
화요일, 3월 10, 2026
HomeMedical NewsNew AI model predicts gene expression across human cell types

New AI model predicts gene expression across human cell types



Summary: A crew of investigators from Dana-Farber Cancer Institute, The Broad Institute of MIT and Harvard, Google, and Columbia University have created a man-made intelligence model that may predict which genes are expressed in any kind of human cell. The model, known as EpiBERT, was impressed by BERT, a deep studying model designed to know and generate human-like language.

EpiBERT was skilled on information from a whole bunch of human cell types in a number of phases. It was fed the genomic sequence, which is 3 billion base pairs lengthy, together with maps of chromatin accessibility that inform which of those sequences are unwound from the chromosome and browse by the cell. The model was first skilled to be taught the connection between DNA sequence and chromatin accessibility across massive chunks of the genome in a particular cell kind. It then makes use of these realized relationships to foretell which genes had been lively within the corresponding cell kind. It precisely recognized regulatory parts – components of the genome acknowledged by transcription elements – and their affect on gene expression across many cell types, constructing a “grammar” that’s generalizable and predictable. This grammar-building course of might be likened to the way in which a big language model, comparable to ChatGPT, learns to construct significant sentences and paragraphs from many examples of textual content. The EpiBERT model can course of accessibility and predict purposeful bases in addition to RNA expression for a never-before-seen cell kind. 

Significance: Every cell within the physique has the identical genome sequence, so the distinction between two types of cells just isn’t the genes within the genome, however which genes are turned on, when, and the way a lot. Approximately 20% of the genome codes for regulatory parts decide which genes are turned on, however little or no is understood about the place these codes are within the genome, what their directions seem like, or how mutations have an effect on operate in a cell. EpiBERT will make clear how genes are regulated in cells and, doubtlessly, how that cell’s regulatory system might be mutated in ways in which result in ailments comparable to most cancers.

Funding: The Broad Institute, the Novo Nordisk Foundation, the National Genome Research Institute, the Sharf Green Cancer Research Fund, the Richard and Nancy Lubin Family, and the American Cancer Society. Tensor Processing Unit (TPU) entry and assist offered by Google.

Source:

Journal reference:

Javed, N., et al. (2025). A multi-modal transformer for cell type-agnostic regulatory predictions. Cell Genomics. doi.org/10.1016/j.xgen.2025.100762.

RELATED ARTICLES
RELATED ARTICLES

Most Popular