Skip to content

Installation

Requirements

  • Python 3.11 or higher
  • pip, uv, or another Python package manager

Basic Installation

pip install databrew
uv add databrew

This installs the core package with HTTP-based fetching (httpx), HTML/JSON extraction, and Parquet storage (via DuckDB).

Optional Dependencies

Browser Support

For JavaScript-heavy websites that require browser rendering:

pip install databrew[browser]
uv add databrew --extra browser

This installs pydoll, a Python browser automation library that uses Chrome DevTools Protocol.

Development Installation

To install from source for development:

git clone https://github.com/datakomari/databrew.git
cd databrew

# Using uv (recommended)
uv sync --all-extras

# Using pip
pip install -e ".[browser]"

Verifying Installation

Check that databrew is installed correctly:

databrew --help

You should see the list of available commands.

Next Steps