🔊 Listen to this post
Speed:

764 Texas Public Data Sources. One Docker Container. $13.37.

Announcing the Texas Scraper Kit — the first tool from c4573.org.

The Problem

Texas has some of the richest public data in the country. Business filings, property records, oil well permits, campaign finance disclosures, professional licenses, environmental violations, unclaimed property — hundreds of government agencies publishing data the public has a legal right to access.

But "public" doesn't mean "accessible."

Each agency has its own website, its own search interface, its own quirks. Some use ColdFusion apps from 2004. Some require custom SSL cipher suites. Some run on Blazor and SignalR. Some return Jasper Reports that take 25 seconds to generate.

If you're a journalist, researcher, real estate investor, or data analyst who needs this data at scale, your options are:

  1. Click through government websites manually. Death by a thousand tabs.
  2. Pay a data vendor $200–$500/month. They scraped the same public data and marked it up 100x.
  3. Build your own scrapers. Weeks of reverse-engineering per agency, plus ongoing maintenance when they change their HTML.

We chose option 4: build it once, sell it cheap, and let everyone run it themselves.

What You Get

The Texas Scraper Kit is a Docker container with a REST API and CLI. You deploy it on your machine — laptop, server, VPS, whatever — and it scrapes Texas government data sources on demand.

  • 764 data sources across dozens of state agencies
  • REST APIPOST /run with source codes, poll for results
  • CLI — run individual scrapers from the command line
  • Multiple output formats — CSV, JSON, or PostgreSQL
  • Docker Compose — one command to deploy the full stack
  • 12 months of updates — when agencies change their sites, we update the scrapers

Verified Working Sources (v1.0)

Every scraper is tested against live government sites. Here's what's confirmed working in the initial release:

Source Agency Data
sos_business Secretary of State Business filings, entity search
tcad_property Travis County Appraisal Property records, valuations
tdlr_license TDLR Professional license verification
rrc_wells Railroad Commission Oil/gas well data
ethics_campaign Ethics Commission Campaign finance disclosures
tceq_violations TCEQ Environmental violations
rrc_gis Railroad Commission GIS pipeline/well mapping
rrc_bulk Railroad Commission Bulk production data
comptroller_franchise Comptroller Franchise tax filings
tec_campaign Ethics Commission Campaign finance (TEC API)

More modules ship weekly. Each update adds new agencies and fixes for site changes.

Who This Is For

  • Real estate investors — property records, tax data, business entity lookups at scale
  • Journalists — campaign finance, environmental violations, license verifications
  • Data analysts — structured, queryable data from sources that only offer clunky search forms
  • Researchers — bulk access to public records without manual clicking
  • Developers — a REST API you can integrate into your own pipelines

Why $13.37

Because data vendors charging $200/month for public records is rent-seeking. The data is public. The scraping is the hard part. We did the hard part and we're charging what a good lunch costs.

No subscriptions. No API keys to manage. No usage limits. It runs on your hardware, scrapes at your pace, stores data where you want it.

Twelve months of updates included. After that, your copy still works — you just stop getting new modules and site-change fixes.

Get It

Buy Texas Scraper Kit — $13.37

Questions? hello@c4573.org