Question 1

What data sources does CivicAlign use?

Accepted Answer

CivicAlign pulls primarily from Congress.gov (bills and roll-call votes), FEC.gov (federal campaign finance), state campaign finance agencies (Governor and state-level races), GovInfo.gov (committee reports), and members' official websites (for public statements). We do not use Twitter/X, news aggregators, or third-party tracking sites as primary sources.

Question 2

How are AI bill summaries generated?

Accepted Answer

New bills are pulled daily from Congress.gov. Long bill text is broken into sections and passed through an LLM with instructions to extract operative provisions, not editorialize. Every summary carries the Congress.gov URL it was generated from. Summaries appear under labels like "AI summary" or "AI title" with a visible link to the source.

Question 3

What's human-reviewed vs. AI-generated?

Accepted Answer

Bill text, vote tallies, sponsor names, and campaign finance figures are human-reviewed (sourced verbatim from Congress.gov, FEC, and state agencies). Plain-English bill summaries, AI titles, said-vs-voted contradiction analyses, and stance scores are AI-generated. Where a piece of content is AI-generated, the surface says so.

Question 4

How often does CivicAlign update its data?

Accepted Answer

New bills and roll-call votes from Congress.gov refresh daily. FEC campaign finance refreshes nightly via bulk import. State campaign finance refreshes nightly when the state publishes structured data. AI bill summaries are generated on first ingest and regenerated only when the bill text changes. Public statements are pulled ad hoc, generally weekly.

Question 5

Is Governor campaign finance sourced from the FEC?

Accepted Answer

No. Governor finance is never sourced from the FEC because the FEC does not track state races. Each state has its own campaign finance agency (e.g., Texas Ethics Commission, Florida Division of Elections), and CivicAlign attributes Governor finance data to that specific state agency.

Question 6

How is the bill-passage probability calculated?

Accepted Answer

Every bill carries a passage-probability badge from a two-engine system: a rule-based heuristic anchored to historical base rates (about 4% for newly introduced bills, 30% for bills reported out of committee, 55% for bills passed in one chamber, 95% for bills sent to the President — calibrated against GovTrack/CRS analyses of the 113th-118th Congresses), and an ML.NET LightGBM model trained on 31,244 resolved bills with 0.96 AUC, 0.61 F1, and 0.95 accuracy on a held-out test set. The model uses structural features (cosponsorship, committees, amendments, policy area, Congress era) plus a 128-dim semantic embedding of the bill's AI-generated summary. We take whichever engine produces the higher probability per bill and label the source in the tooltip. Predictions above 80% on freshly-introduced bills should be treated with skepticism — at that stage the base rate is ~4%.

Question 7

What are the known limitations of CivicAlign?

Accepted Answer

State campaign finance coverage is uneven (some states do not publish structured data). Said-vs-voted analyses require a recorded floor vote. AI summaries are statistical paraphrase, not legal interpretation. Stance scoring uses a recent window of roll-call history. CivicAlign does not track state legislatures — only the U.S. Senate, U.S. House, Governor races, and the federal corpus on Congress.gov.

Question 8

How do I report a mistake?

Accepted Answer

File a correction at /corrections. We review every submitted correction within five business days and log resolved corrections publicly on that page.

Surface	Human-reviewed	AI-generated
Bill text, vote tallies, sponsor names	Yes (verbatim from Congress.gov)	—
Campaign finance figures	Yes (verbatim from FEC / state agencies)	—
Plain-English bill summaries	—	Yes
One-sentence bill titles ("AI titles")	—	Yes
Said-vs-voted contradiction analyses	—	Yes
Stance scores ("strongly for / mixed / against")	—	Yes (derived from vote records)
Hero copy, navigation labels, FAQs	Yes	—

Data	Refresh interval
New bills from Congress.gov	Daily
Roll-call vote records	Daily
AI-generated bill summaries	Generated on first ingest; regenerated only if the bill text changes
FEC campaign finance	Nightly (bulk import)
State campaign finance	Nightly, when the state publishes structured data
Public statements (said-vs-voted)	Ad hoc, generally weekly

Stage	Base passage rate
Introduced	~4%
In committee	~5%
Reported out of committee	~30%
Floor debate	~42%
Passed one chamber	~55%
In conference	~75%
Sent to the President	~95%

Metric	Value	What it means
AUC	0.96	Given any pair of (passed, failed) bills, the model ranks the passed one higher 96% of the time.
F1	0.61	Harmonic mean of precision and recall on the positive class (becomes-law). Hard to push higher because positives are only ~5% of bills.
Accuracy	0.95	Fraction of all predictions that match the ground truth, on the held-out test set.

How CivicAlign works — and where it doesn't.

Data sources

How summaries are generated

What's human-reviewed vs. AI-generated

Update cadence

How the bill-passage prediction works

Base rates we calibrate against

Adjustments the heuristic layers on top

What the ML model learns

What the prediction will not do

Verifying the model is calibrated

Known limitations

Found a mistake?