Skip to content

Installation

Requirements

  • Python 3.11 or higher
  • pip, uv, or another Python package manager

Basic Installation

pip install databrew
uv add databrew

This installs the core package with HTTP-based fetching (httpx) and HTML/JSON extraction.

Optional Dependencies

Databrew has optional extras for additional functionality:

Browser Support

For JavaScript-heavy websites that require browser rendering:

pip install databrew[browser]
uv add databrew --extra browser

This installs pydoll, a Python browser automation library that uses Chrome DevTools Protocol.

Analytics Support

For faster exports with large datasets (7-10x speedup):

pip install databrew[analytics]
uv add databrew --extra analytics

This installs DuckDB, which enables fast JSONL, JSON, and Parquet exports by reading SQLite directly.

All Extras

Install everything:

pip install databrew[browser,analytics]
uv add databrew --extra browser --extra analytics

Development Installation

To install from source for development:

git clone https://github.com/datakomari/databrew.git
cd databrew

# Using uv (recommended)
uv sync --all-extras

# Using pip
pip install -e ".[browser,analytics]"

Verifying Installation

Check that databrew is installed correctly:

databrew --help

You should see the list of available commands.

Next Steps