skrub

Python library for preprocessing and feature engineering on dataframes.

EstablishedOpen SourceLow lock-in

Pricing

See website

Flat rate

Adoption

Stable

License

Open Source

Data freshness

Overview

What is skrub?

Skrub is a Python library that simplifies the process of preprocessing and feature engineering, making it easier to prepare data for machine learning tasks. It focuses on enhancing the quality and usability of dataframe-based datasets.

Key differentiator

Skrub stands out by providing a streamlined approach to dataframe preprocessing, making it easier for developers to focus on model development rather than data preparation.

Capability profile

Strength Radar

Simplifies prepr…Enhances datafra…Offers a variety…

Honest assessment

Strengths & Weaknesses

↑ Strengths

Simplifies preprocessing and feature engineering tasks.

Enhances dataframe-based datasets for machine learning.

Offers a variety of methods to clean and transform data.

Fit analysis

Who is it for?

✓ Best for

Developers working with Python who need to preprocess large datasets efficiently.

Teams that require robust feature engineering capabilities without complex setup.

✕ Not a fit for

Projects requiring real-time data processing and transformation.

Users looking for a web-based UI for data preprocessing tasks.

Cost structure

Pricing

Free Tier

None

Starts at

See website

Model

Flat rate

Enterprise

None

Performance benchmarks

How Fast Is It?

Ecosystem

Relationships

Alternatives

Next step

Get Started with skrub

Step-by-step setup guide with code examples and common gotchas.

View Setup Guide →