ADAM

Genomics processing engine and file format using Apache Avro, Spark, and Parquet.

EstablishedOpen SourceLow lock-in

Pricing

See website

Flat rate

Adoption

Stable

License

Open Source

Data freshness

Overview

What is ADAM?

ADAM is a specialized genomics data processing tool built on top of Apache Spark. It provides efficient storage and querying capabilities for genomic datasets through its use of Apache Avro and Parquet formats.

Key differentiator

ADAM stands out as an efficient and scalable genomics data processing tool built on Apache Spark, offering specialized support for genomics file formats.

Capability profile

Strength Radar

Efficient storag…Built on top of …Supports a wide …

Honest assessment

Strengths & Weaknesses

↑ Strengths

Efficient storage and querying of genomic data using Apache Avro and Parquet formats.

Built on top of Apache Spark for scalable processing.

Supports a wide range of genomics file formats.

Fit analysis

Who is it for?

✓ Best for

Research teams working with large-scale genomics datasets who need efficient storage and processing capabilities.

Organizations integrating genomics data into their existing big data pipelines built on Apache Spark.

✕ Not a fit for

Projects requiring real-time streaming of genomic data (ADAM is optimized for batch processing).

Teams looking for a cloud-hosted solution without the need to manage infrastructure.

Cost structure

Pricing

Free Tier

None

Starts at

See website

Model

Flat rate

Enterprise

None

Performance benchmarks

How Fast Is It?

Next step

Get Started with ADAM

Step-by-step setup guide with code examples and common gotchas.

View Setup Guide →