MetaClean

MetaClean is a web app (with optional CLI) that validates and standardizes sequencing sample metadata before it enters your pipeline. It ingests sample sheets (Illumina, MGI, custom CSV/TSV), LIMS exports, and minimal clinical metadata, then runs rule-based checks plus lightweight AI-assisted mapping to your lab’s controlled vocabulary (organism, tissue, library prep, read layout, barcodes, consent flags). It flags common failure points: duplicate sample IDs, barcode collisions, inconsistent lane assignments, forbidden characters, missing required fields, and mismatched reference builds. It outputs a clean, versioned sample sheet plus a human-readable “what changed and why” report for audit trails. Brutal truth: this is not glamorous science—it's operational hygiene—but it saves real money by preventing failed runs, rework, and downstream analysis confusion.

← Back to idea list