Somatic Mosaicism across Human Tissues Data Portal
A platform to search, visualize, and download somatic mosaic variants in normal tissues.
Latest Official Release: October 11, 2025
Benchmarking - with all technologies
CellLines
Assays
Mutations
FilesGenerated
Production - with core + additional technologies
CELL LINES & TISSUES
Ectoderm Tissues
Brain: unrelated donors
Sun-exposed skin
Endoderm Tissues
Liver
Lung
Colon
Cell Line Mixtures
COLO829 Mixture
HapMap Mixture
iPSC & Fibroblast
AVAILABLE ASSAYS
Bulk WGS short read
Bulk WGS long read
Bulk RNA-seq
Single-cell WGS
Single-cell RNA-Seq
Single-molecule/duplex WGS
Epigenome profiling
New Data Releases
Announcements
New Access RegulationMar 30, 2026
Due to new data access regulations, all SMaHT Data Portal users MUST use an institutional email address to login to the Data Portal.
Gmail, yahoo, hotmail, and other free email accounts will be prohibited from accessing the Data Portal effective 4/1/26.
Gmail, yahoo, hotmail, and other free email accounts will be prohibited from accessing the Data Portal effective 4/1/26.
Data RetractionDec 11, 2025
Illumina bulk WGS data were retracted for production donor samples; SMHT001-3A, SMHT005-3AF, SMHT007-3A, and SMHT022-3A, due to data duplication.
Attention: BAM changeJul 23, 2025
As of July 8, 2025, DAC will release BAM files without the BI and BD tags, which were originally added after base quality recalibration (BQSR).
Data RetractionMar 10, 2025
One WGS ONT PromethION 24 BAM from COLO829-BLT50, SMAFIPHR8QOG, has been retracted due to sample swap.
New FeaturesJan 25, 2025
Explore the Interactive QC Assessment page for data on the portal.
Attention Users
The V1 Benchmarking data portal will be open to SMaHT consortium members only at this time.
Data-related news
- The raw sequence files, i.e. unaligned BAM and FASTQ, and the data from the benchmarking tissue samples that were not distributed by TPC will be available upon request at this time (through Globus).
- The SMaHT Data Portal, V1 Benchmarking release, now makes benchmarking data available for download for authenticated consortium members. Users can continue to obtain the access keys for metadata submission.

