Metabarcoding outgrew custom pipelines. spcfy is what comes next.
Shared infrastructure for labs, bioinformaticians, and researchers who need DNA Metabarcoding data to be reproducible, comparable, and decision-ready. One standard, every project, every lab. Not in months. In hours.
Not ready for a demo? Read the workflow documentation →
Covering CO1, 16S, ITS2, 12S, PITS2. Animals, plants, fungi, bacteria.
Three roles in the same data flow. Pick the one that fits today.
spcfy is shared infrastructure between labs, the people who generate meaning from the data, and the customers who act on it. Jump to the part that matches your work.
DNA Metabarcoding has outgrown spreadsheets.
A single project generates thousands of OTUs across hundreds of samples, each with ecological metadata. Species lists do not carry that complexity. Lab-specific pipeline stacks do not scale with it.
You need infrastructure built for this volume.
Without it, every new project starts from scratch. Every comparison needs a caveat. Every dataset ages the moment the next reference database version drops.
Raw sequence output has no memory. spcfy gives it one.
Every OTU gets a stable identity. Every result carries its pipeline version, its reference database, its QC threshold. Your 2024 data is still comparable in 2026.
Results that hold up. Across labs, databases, and time.
Every order goes through a multi-database consensus pipeline, built by bioinformaticians who run their own labs.
SPARK · Consensus Taxonomy
SPARK evaluates all BLAST hits and selects the most reliable match using a weighted score that combines identity, taxonomic resolution, and consistency across hits. Do not rely on a single top hit — identify the overall best. Robust. Precise. Built for reproducibility.
LCA · Taxonomic Annotation
spcfy applies a rich set of taxonomic annotations — worldwide or regional BOLD, NCBI GenBank, SILVA, UNITE, RDP — tailored to different amplicons. It then condenses them into a Least Common Ancestor taxonomic consensus. Built for easy interpretability.
SPIN · Stable OTU Identity
SPIN assigns each sequence a stable spcfy identification number, analogous to a BIN but designed for metabarcoding. OTU sequences are matched against a reference SPIN database, allowing sequences from different runs to be merged instantly without re-clustering. Consistent tracking. Direct comparability of taxa across samples, projects, and time.
Speak to your data
Ask questions across your biodiversity datasets in plain language. Surface patterns, flag anomalies, generate reports. Powered by spcfAI, built on the same standardized data layer.
Part of spcfy PREDICT and spcfAI.Your data stays yours. Full provenance, full control.
spcfy is built on data ownership as a first principle. Every dataset, raw and processed, is available for download at any time, in standard formats. No lock-in.
You own your data
All data is hosted in Germany, GDPR-compliant, with clear access controls. You decide who sees what, when, and in what format.
Every project carries a locked audit trail: pipeline version, reference database version, QC thresholds. Your results stay reproducible and citable.
Raw and processed datasets are available for download in standard formats at any time. Full data ownership is given from day one.
FAIR-aligned publishing, when you want it
In developmentWhen publication is the goal, spcfy is designed to make it straightforward. Standardized exports, Darwin Core compatibility, DOI-ready metadata, and full parameter provenance will travel with your dataset.
Direct connections to public repositories are currently in development. Once available, all exports will be structured for direct submission to:
Labs deliver. Customers explore. Everything is comparable.
spcfy is not a tool for one lab. It is shared infrastructure where every lab's output follows the same standard, so customers can actually compare results across labs and over time.
From sequencing run to client-ready delivery, without custom scripts or manual QC.
Four tiers. One ecosystem.
Start with the data foundation. Add analytical depth as your needs grow.
The data foundation. From raw FASTQ to standardized, comparable output.
The analytical layer. Explore, compare, and share biodiversity intelligence.
Predictive layers and Nature KPIs. Where understanding becomes foresight.
Natural language interface for your biodiversity data. Ask. Explore. Act.
Come talk DNA Metabarcoding with us.
We are at BioMonWeek in Montpellier with the people who built spcfy. Find us on site for a direct conversation about your workflow, your data, and whether spcfy fits.
Find us by our cards: three colors, three roles, one infrastructure.
We can still connect.
No time to talk on site, or we missed each other entirely? Book an online demo and we will walk you through the platform at a time that works for you. New accounts also get 4 weeks of freemium access with 15 processing credits, no commitment required.
From field sample to biodiversity signal.
In hours. On infrastructure built for this volume and complexity.
Questions first? Write to us at hello@spcfy.io