Genomics Is Not NLP: A Field Guide for ML Scientists
A field guide for ML scientists moving into genomics and transcriptomics: why DNA only looks like text, why the whole species is one near-duplicate corpus, how regulation defeats the context window, the biology you cannot skip, why the molecule you sequence is not the one that acts, and what the famous foundation models do and don’t solve.