Updated February 8, 2018
Downloading data from this page implies that agreement with the following restrictions:
- These data are not to be used for commercial or for-profit use.
- These data may not be sold or otherwise restricted to others.
- These data may be used only for research, academic or scholarly pursuits to advance knowledge in related fields.
- George Mason University and Jennifer Victor are not responsible for violations of these intended usages of these data.
- Please cite your usage of these data using the following citation:
Victor, Jennifer Nicoll. 2018. “NAME OF DATASET.” Data available for scholarly use from https://jvictor.gmu.edu/index.php/data-download/ DATE ACCESSED.”
If you find errors, please communicate them to firstname.lastname@example.org. I will post notes and updated data as time permits.
U.S. Congress Caucus Data
For complete caucus membership data from the 103rd to 113th congresses, please visit: http://bridgeinfogap.org/database/
The files called “membership_xxx.csv” are incidence matrices where the first column indicates the MC identifier and the first row indicates the caucus identifier. Values are dichotomous, where 1= caucus membership.
The files called “leadership_xxx.csv” are incidence matrices where the first column indicates the MC identifier and the first row indicates the caucus identifier. Values are dichotomous, where 1 = caucus leader.
The files called “Caucuses_xxx.csv” contain two columns. The first column indicates the caucus identifier. The second column indicates the caucus name, a string. The first row has column labels.
The following files were created with assistance from Justin Kirkland. The bill data were scraped from Thomas. Matrices are incidence files with MCs as rows (with IDs in the first column) and bills in columns. 1 = sponsorship, 2 = cosponsorship. Some MCs are removed from these files so that IDs and MCs match caucus data. See notes. Files are in comma separated values (.csv) format. As of now, only House data are included.
These data were compiled with assistance from Steven Haptonstahl and Justin H. Gross. Data below are Excel files (.xlsx) that include square adjacency matrices of co-voting for the 103rd-111th congresses, with legislator IDs. Legislators are coded as 1 if they voted “Yea” or were recorded as an “Announced Yea” vote (originally, 1 and 3, respectively in raw roll call data). Legislators are coded as -1 if they voted “Nay” or “Announced Nay” (originally 6 and 4 in raw data, respectively). Those voting “Present” (7) or “Not Voting” (9) are coded as 0.
- Calculate the sum (S) of all the dyadwise agreements (1s) and disagreements (-1).
- Calculate the sum of the absolute values of all votes (V), which returns the number of roll calls in which both members of the dyad participated.
- Calculate the number of times the dyad cast the same vote (A): (S+V)/2
- Calculate the rate of agreement for each dyad, given that both voted: A / V
The matrices report the rate of voting agreement for each dyad in each congress across all roll call votes for the House.