This dataset (data extraction spreadsheet) is associated with an exploratory evaluation of pharmaceutical industry clinical study reports (CSR) for possible use in evidence synthesis and systematic reviews. 78 CSRs from public sources were selected for data extraction. The report dates range from 1991 through 2011, inclusive, and represent 90 randomized controlled trials of 14 pharmaceuticals. The primary outcome measures include presence and length of essential elements of trial design and reporting and compression factor (ratio of page length for CSRs compared to its published counterpart in a scientific journal). The dataset is comprised of an audited table of extracted and derived variables the details for which are described in an accompanying readme file. The uncorrected (original) and corrected extraction sheets as well as audit records are available upon request from Peter Doshi, corresponding author (firstname.lastname@example.org).