erichall/Sabo

Fork 0

SABO (sabo.network) is a corporate accountability and consumer activism platform built to concentrate economic and reputational pressure on corporations that lobby against popular public interest legislation — using capitalism's own mechanisms to hold corporations accountable. The platform has two core components: a community space for coordinated activist action, and an accountability database that synthesizes government documents and trusted datasets (from OpenSecrets, SEC EDGAR, GAO, and others) into accessible intelligence enriched with peer-reviewed analysis from journalists, researchers, and vetted contributors. The MVP is focused on the healthcare/insurance ecosystem — specifically pharmacy benefit management concentration — with four launch campaigns targeting Amazon, Meta, Cigna, and Wells Fargo. http://sabo.network

Python 57.4%
Svelte 20.2%
TypeScript 11.8%
Shell 7.4%
PowerShell 2.2%
Other 0.9%

Find a file

erichall 6c68aa34b8 Add git_workflow.md		2026-07-24 22:48:37 +00:00
.claude	[Spec Kit] Add project constitution	2026-05-19 09:17:59 -04:00
.specify	Implement specs 014 and 015: share analytics, impact dashboard, and design system	2026-05-24 15:13:17 -04:00
backend	Implement specs 014 and 015: share analytics, impact dashboard, and design system	2026-05-24 15:13:17 -04:00
config	[Spec Kit] Add project constitution	2026-05-19 09:17:59 -04:00
docs	Upload files to "docs"	2026-05-24 10:28:17 +00:00
frontend	Replace all "scoreboard" references with impact dashboard language	2026-05-24 16:06:59 -04:00
mobile	Implement specs 014 and 015: share analytics, impact dashboard, and design system	2026-05-24 15:13:17 -04:00
nginx	[Spec Kit] Add project constitution	2026-05-20 18:29:39 -04:00
scripts	[Spec Kit] Implementation progress	2026-05-18 21:39:46 -04:00
specs	Implement specs 014 and 015: share analytics, impact dashboard, and design system	2026-05-24 15:13:17 -04:00
src	fix: update LDA client to use filing_year + filing_type filters (API has no date range param)	2026-05-18 23:00:33 -04:00
.env.example	[Spec Kit] Implementation progress	2026-05-18 21:39:46 -04:00
.gitignore	[Spec Kit] Implementation progress	2026-05-21 17:35:09 -04:00
CLAUDE.md	Implement specs 014 and 015: share analytics, impact dashboard, and design system	2026-05-24 15:13:17 -04:00
docker-compose.yml	[Spec Kit] Add project constitution	2026-05-20 18:29:39 -04:00
git_workflow.md	Add git_workflow.md	2026-07-24 22:48:37 +00:00
ledger_speckit.md.docx	Add files via upload	2026-05-18 14:22:03 -04:00
README.md	[Spec Kit] Implementation progress	2026-05-18 21:39:46 -04:00
requirements.txt	[Spec Kit] Implementation progress	2026-05-18 21:39:46 -04:00
Sabo Alliance Map.pages	Upload files to "/"	2026-05-24 04:28:02 +00:00
SABO_Context_Document.md.docx	Add files via upload	2026-05-18 14:22:03 -04:00

README.md

SABO

Stop Allowing Big money to Overwhelm — give everyday voters coordinated economic leverage against corporate lobbying.

LDA Data Pipeline

Queries the LDA API weekly for high-spend lobbying filings, matches them against active bills, stores campaigns for curator review, and exposes approved campaigns to downstream systems (browser extension, dashboard).

Prerequisites

Python 3.11+
A free LDA API key from lda.senate.gov (register for an account)
macOS or Linux

Setup

# 1. Install dependencies
pip install -r requirements.txt

# 2. Configure environment
cp .env.example .env
# Edit .env and set your LDA_API_KEY

Configure Monitored Bills

Edit config/bills.yml to add the bills you want to track:

bills:
  - bill_id: "S-4006"
    title: "Healthcare Price Transparency Act"
    keywords:
      - "health care"
      - "price transparency"
      - "hospital costs"
    status: "active"

Only bills with status: "active" are used by the pipeline. Keyword matching is case-insensitive substring search against the filing's issue descriptions.

Run the Pipeline

# Dry run (no database writes — good for testing API connectivity)
python -m src.pipeline.runner --dry-run

# Live run (fetches last 8 days of filings)
python -m src.pipeline.runner

# Specify a custom date range
python -m src.pipeline.runner --since 2026-01-01

# Verbose output (logs each filing as it is processed)
python -m src.pipeline.runner --verbose

Exit codes: 0 = success, 1 = API error, 2 = config error, 3 = database error

Review Campaigns

# List campaigns pending review
python -m src.review.cli list

# List by status
python -m src.review.cli list --status approved
python -m src.review.cli list --status rejected

# Approve a campaign
python -m src.review.cli approve 1

# Reject a campaign (with optional reason)
python -m src.review.cli reject 2 --reason "Spend data appears incorrect"

# View full history with filters
python -m src.review.cli history
python -m src.review.cli history --status approved
python -m src.review.cli history --company "amazon"
python -m src.review.cli history --since 2026-01-01

Configure Cron (Weekly Schedule)

# Make the script executable (already done if you cloned the repo)
chmod +x scripts/run_pipeline.sh

# Edit your crontab
crontab -e

# Add this line to run every Sunday at midnight:
0 0 * * 0 /path/to/sabo/scripts/run_pipeline.sh >> /path/to/sabo/data/pipeline.log 2>&1

Validation

See specs/001-lda-data-pipeline/quickstart.md for a step-by-step end-to-end validation guide covering all four scenarios.

Project Structure

src/
  models/models.py        — LDAFiling, MonitoredBill, Campaign dataclasses
  db/database.py          — SQLite connection + schema
  db/campaigns.py         — Campaign CRUD + downstream query interface
  db/bills.py             — MonitoredBill read operations
  pipeline/runner.py      — Pipeline entry point
  pipeline/lda_client.py  — LDA API HTTP client
  review/cli.py           — Curator review CLI
  config/loader.py        — YAML bills config loader
config/bills.yml          — Monitored bills (curator-maintained)
scripts/run_pipeline.sh   — Cron wrapper
data/sabo.db              — SQLite database (gitignored)