Buffy coat specimens remain viable as a DNA source for highly multiplexed genome-wide genetic tests after long term storage

Mychaleckyj, Josyf C; Farber, Emily A; Chmielewski, Jessica; Artale, Jamie; Light, Laney S; Bowden, Donald W; Hou, Xuanlin; Marcovina, Santica M

doi:10.1186/1479-5876-9-91

Research
Open access
Published: 10 June 2011

Buffy coat specimens remain viable as a DNA source for highly multiplexed genome-wide genetic tests after long term storage

Josyf C Mychaleckyj¹,
Emily A Farber¹,
Jessica Chmielewski²,
Jamie Artale¹,
Laney S Light³,
Donald W Bowden⁴,
Xuanlin Hou¹ &
…
Santica M Marcovina²

Journal of Translational Medicine volume 9, Article number: 91 (2011) Cite this article

12k Accesses
29 Citations
3 Altmetric
Metrics details

Abstract

Background

Blood specimen collection at an early study visit is often included in observational studies or clinical trials for analysis of secondary outcome biomarkers. A common protocol is to store buffy coat specimens for future DNA isolation and these may remain in frozen storage for many years. It is uncertain if the DNA remains suitable for modern genome wide association (GWA) genotyping.

Methods

We isolated DNA from 120 Action to Control Cardiovascular Risk in Diabetes (ACCORD) clinical trial buffy coats sampling a range of storage times up to 9 years and other factors that could influence DNA yield. We performed TaqMan SNP and GWA genotyping to test whether the DNA retained integrity for high quality genetic analysis.

Results

We tested two QIAGEN automated protocols for DNA isolation, preferring the Compromised Blood Protocol despite similar yields. We isolated DNA from all 120 specimens (yield range 1.1-312 ug per 8.5 ml ACD tube of whole blood) with only 3/120 samples yielding < 10 ug DNA. Age of participant at blood draw was negatively associated with yield (mean change -2.1 ug/year). DNA quality was very good based on gel electrophoresis QC, TaqMan genotyping of 6 SNPs (genotyping no-call rate 1.1% in 702 genotypes), and excellent quality GWA genotyping data (maximum per sample genotype missing rate 0.64%).

Conclusions

When collected as a long term clinical trial or biobank specimen for DNA, buffy coats can be stored for up to 9 years in a -80degC frozen state and still produce high yields of DNA suitable for GWA analysis and other genetic testing.

Trial Registration

The Action to Control Cardiovascular Risk in Diabetes (ACCORD) trial is registered with ClinicalTrials.gov, number NCT00000620.

Background

Clinical trials and prospective observational cohort studies are complex to design and costly to implement, hence there is a strong desire to maximize overall clinical and scientific return on investment. A common strategy is to include blood specimen collection at a baseline or early participant study visit to enable future ancillary studies or analysis of secondary biomarker outcomes. The blood specimens may be processed to produce aliquots of sera, plasma, or blood cell pack that are stored frozen for future use. For genetics studies, DNA is more stable under long-term freezer storage, but in many existing or completed studies, the study protocol required the extraction and storage of buffy coats (aliquots of white blood cell pack) [1, 2]. The Action to Control Cardiovascular Risk in Diabetes (ACCORD) clinical trial is one such study that banked buffy coat specimens for future use in genetic ancillary studies. Several studies have demonstrated a decreased DNA yield with frozen storage over time [3–5].

The ACCORD trial was a randomized, multicenter, double 2 × 2 factorial design which recruited 10,251 type 2 diabetes patients that were randomized to glycemic interventions, of which 5,518 were randomized to lipid interventions in one 2 × 2 trial and 4,733 randomized to blood pressure interventions in the second 2 × 2 trial [6, 7]. The trial was designed to test the effects on major cardiovascular disease events of intensive glycemia control, treatment to increase HDL-cholesterol and lower triglycerides, and intensive blood pressure control (in the context of good glycemia and LDL control). Recruitment occurred in two phases, January - June 2001 (Vanguard Phase, N = 1,174), and February 2003-October 2005 (N = 9,077) [8]. Participants were recruited, randomized, treated, and followed through a system of seven Clinical Center Networks (CCNs). Each CCN consisted of a network of collaborating clinical sites.

The trial protocol required clinics to collect a single 8.5 ml Acid Citrate Dextrose (ACD) tube of whole blood from trial participants who consented to the use of a blood specimen for future genetics studies. The specimens were refrigerated, shipped, and processed to yield buffy coats, which were stored frozen at -80 degC. These specimens have been in storage for variable periods of time, and up to 9 years for the trial Vanguard phase participants. It was unclear whether they had degraded significantly and were no longer viable for modern highly multiplexed GWA genotyping assays that simultaneously genotype 1 million SNPs and CNV probes (or more). We designed this study to answer two specific questions:

1.
What was the total yield of DNA that could be expected from the buffy coat specimens collected and stored under the ACCORD trial protocol?
2.
Was the isolated DNA still of sufficient quality to provide a substrate for multiplex GWA genotyping?

We isolated the DNA from 120 ACCORD trial buffy coat specimens selected from all 8 sub-arms of the trial to sample a range of storage times and other study factors that could predict total DNA yield. We performed individual SNP genotyping on aliquots from all 120 specimens and a GWA genotyping assay on 32 of the 120 to test whether the isolated DNA has retained molecular integrity for high-quality GWA study analysis.

Methods

Buffy Coat Specimen Collection and Storage

One 8.5 ml ACD tube of whole blood was collected from ACCORD participants during their baseline trial visit and refrigerated at 4 degC at the recruitment clinic until shipment. Institutional Review Board approval was obtained from all recruitment, laboratory, or data management sites and written informed consent was obtained from study subjects. The shipping protocol required the clinics to ship the blood tube on cold pack refrigerant to the ACCORD Central Laboratory on the same day as collection by overnight courier (within 24 hours). All buffy coats, without exception, were extracted on the day of receipt at the Central Laboratory. Processing and storage of the buffy coat fraction (white blood cell layer) was performed following the recommendations of the NHLBI Working Group [1]. Briefly, the ACD tube was centrifuged at 2000 rpm for 30 mins and the plasma removed. Using a sterile transfer pipette, the buffy coat layer was transferred to a sterile barcoded cryovial and placed on ice. An equal volume of cell freezing solution (99% glycerol, 50 nM sodium citrate, 20 mM sodium phosphate monobasic, monohydrate, and 20 mM sodium phosphate, dibasic, anhydrous) was added and the cells suspended by gentle rocking. The cryovial containing the cells was immediately transferred to a -80 degC REVCO Ultima freezer (Thermo Fisher Scientific Inc., Waltham, MA) with audible/visual warning for power failure and temperature deviation beyond set points. The freezer was monitored by a 24 hr alarm monitoring company, with monthly testing for alarm system operation. The freezer was connected to a 100 KW on site backup generator with automatic failover in case of a power outage.

Prior to DNA isolation, 120 stored frozen buffy coat specimens were randomly selected by the ACCORD Coordinating Center out of 6,008 participants who had consented to the broadest categories of genetics study usage of their specimen. The specimens were sampled with even distribution across a range of blood sample storage duration and trial assignment. There were fewer specimens available for selection in the period 2006-2009 than from earlier years (none were drawn in 2002). An equal number of specimens were selected from each of 5 time periods (2001, 2003, 2004, 2005, and 2006-2009) to sample a range of storage durations. No more than 3 participants were selected per clinic site. The specimen characteristics are shown in Table 1. The 120 sampled individuals are representative of the overall trial pool: 35% female (39% trial); 64% white (62% trial); 19% African American (19% trial); mean age at draw 63.5 years (62.2 years trial).

Table 1 Clinical characteristics of the stored buffy coat samples selected for DNA isolation, N = 120 total samples

Full size table

DNA Isolation Protocol

Frozen buffy coat specimens were shipped to University of Virginia Center for Public Health Genomics for DNA isolation and GWA genotyping. DNA was isolated using automated purification protocols on a QIAGEN^® Autopure LS^®. Two initial test runs of 8 samples each were performed to compare candidate isolation protocols:

1.
QIAGEN^® Automated purification of DNA from fresh or frozen buffy coat on the Autopure LS^® (protocol version AP03 Nov-07, up to 10 ml sample)
2.
QIAGEN^® Automated purification of DNA from compromised blood samples on the Autopure LS^® (protocol version AP06 Nov-07, up to 10 ml sample).

(Protocol documents are available at http://www.qiagen.com). According to the vendor description, protocol 1. for fresh or frozen buffy coats is applicable for samples frozen at -80 degC directly after collection and stored for less than 2 years at this temperature. The main differences between the protocols are that the compromised blood protocol 2 dispenses additional RBC lysis reagent (40 ml versus 35 ml total volume) and incubates for 30 seconds longer during lysis; uses 4 ml protein precipitation solution versus 3.34 ml during WBC lysis/protein precipitation step and centrifuges at 3000 g for 5 min versus 2 mins; centrifuges for 5 mins versus 2 mins at 3000 g during initial DNA precipitation step; and during DNA wash, uses 12 ml 70% ethanol versus 10 ml, and centrifuges for 5 min versus 1 min at 3000 g after alcohol wash to re-precipitate the DNA. Since the results from the compromised blood protocol 2 were superior, the compromised blood protocol was used for the remaining 104 samples. The remaining samples were processed in runs of 16 samples at a time.

DNA Quality Control

After isolation and purification, the DNA was quantitated on a NanoDrop 8000, to measure concentration and assess the purity of the DNA through standard A260/A280 and A260/A230 ratios. The DNA was diluted to 400 ul total, except where yields were lower. A 3 ul aliquot of the DNA solution was evaluated for DNA length distribution and potential degradation by electrophoresis on a 1% agarose gel against a molecular weight ladder with ethidium bromide staining.

SNP Genotyping QC

One hundred and seventeen DNA specimens were tested for success in genotyping individual SNPs using Applied Biosystems TaqMan^® assays. Three (3/120) samples were not genotyped due to low total DNA yield (1.07 ug, 4.00 ug, 5.22 ug) and the need to preserve the DNA for future disease genetics studies. The TaqMan genotyping assay is a QC test of the suitability of the isolated DNA for single SNP genotyping. Failure in this step indicates that the DNA quality is unlikely to be sufficient for highly multiplexed GWA genotyping. A limited panel of 6 high heterozygosity autosomal SNPs located on different human chromosomes was selected for this purpose. Applied Biosystems TaqMan^® Genotyping Assay Protocol (Part Number 4332856 Rev. C 05/2006) was used to genotype the SNPs on an Applied Biosystems 7900HT Fast Real-Time PCR System using standard reagents and standard cycling protocols. The SNPs are listed in Table 2.

Table 2 SNP panel composition and genotyping results

Full size table

GWA Genotyping Assay

Thirty two specimens were randomly pre-selected by the ACCORD Coordinating Center for GWA genotyping before DNA isolation results were available. After DNA isolation 5 of these were found to have total DNA yield < 50 ug. To conserve these for future analysis, we substituted 5 higher yield specimens (yield > 50 ug) that matched the substituted specimen characteristics as far as possible with respect to year, gender, race (4/5 matches), and recruiting CCN. The 32 samples were genotyped on 8 Illumina Human Omni1-Quad beadchips, each beadchip assaying four samples for 1,140,419 SNPs and CNV probes. A minimum of 200 ng of DNA is required per sample http://www.illumina.com/documents/products/datasheets/datasheet_humanomni1_quad.pdf. The genotyping assay was performed according to the standard Illumina Infinium HD Super Assay protocol (Infinium HD Super Assay Protocol Guide, Catalog #WG-901-4002 Part# 11322427 Rev.B).

GWA Genotyping Assay QC

The quality of the GWA genotyping data was assessed using the vendor built-in positive and negative quality control steps in the Illumina GenomeStudio software suite. Seven GWA genotyping assay controls included with every Illumina Infinium HD array monitor amplification, hybridization, extension, stripping, and staining which are assessed using the GenCall dashboard [9]. These were visually inspected for all sample GWA assays http://www.illumina.com/software/genomestudio_software.ilmn. The Gentrain2 algorithm was used for SNP quality scoring and the genotypes were also curated according to standard vendor genotyping QC protocol. Since only 32 samples were clustered, the standard cluster file (HumanOmni1-Quad_v1-0_B.egt) was used as per vendor recommendations for projects with less than 100 samples (Illumina Technote "Infinium Genotyping Data Analysis" http://www.illumina.com/Documents/products/technotes/technote_infinium_genotyping_data_analysis.pdf. SNP curation was performed following recommendations in the same document. This protocol identifies SNPs that should be manually reviewed by an experienced technician. All genotypes for poorly performing SNPs were set to missing. A separate cluster analysis was performed for X chromosome SNPs.

GWA Statistical Genotype QC Analysis

After the genotyping laboratory QC was complete, data was exported from Illumina GenomeStudio for additional QC and statistical analysis. This QC mirrored the standard steps used for genotype data QC in many GWA studies to control the type 1 error rate associated with multiple testing of many thousands of SNPs [10, 11]. The statistics included genotype missing rates by sample and by SNP.

Results

Isolated DNA Yields

We were able to isolate DNA from all 120 buffy coat specimens, with varying total yield. Since the buffy coat specimens had been in storage for range of durations up to 9 years, we tested two automated DNA purification protocols on a QIAGEN Autopure LS, 1) fresh or frozen buffy coat and 2) compromised blood sample. We compared the two protocols by comparing the yield and assay performance on a randomly selected subset of 8 samples for each protocol. We found no significant difference in the mean yield between the first subset, isolated using the buffy coat protocol, and second compromised blood protocol subset, mean yield (+/-sem) 139.3 +/- 9.0 ug and 162.5 +/- 9.8 ug respectively (Welch t-test p = 0.55); or between the first 8 and the remaining 112 isolated using the Compromised Blood Protocol, mean yield 139.3 +/-9.0 ug and 134.4 +/-0.6 ug (p = 0.86). However the Buffy Coat protocol group of 8 samples appeared to contain protein contamination after purification, did not rehydrate well, and had to be re-purified manually. The second group showed clean pellets and dissolved into solution without difficulty, hence we chose this protocol for automated purification of the rest of the samples.

The distribution of total yield from all 120 samples is shown in Figure 1. For the 112 Compromised Blood Protocol specimens, the range of yields was 1.1-312.2 ug. Thirteen samples (11.6%) yielded < 50 ug, while 3 samples (2.67%) produced a yield of < 10 ug of DNA. For all 120 samples including the 8 isolated by Buffy Coat protocol, 14 yielded total DNA < 50 ug (11.7%), 4 samples yielded < 10 ug (3.3%), and 111 (92.5%) had sufficient yield to dilute into 400 ul total for future DNA stock solution (minimum required concentration 100 ng/ul). The lowest yield samples were diluted into 50 ul total stock to allow for multiple future aliquots but with variable lower concentrations. The mean yield for all 120 samples was 134.7 ug +/-0.6 ug and median was 130.6 ug.

To investigate the effect of study or participant factors on the yield, we tested linear regression models of total DNA yield. Figure 2 shows the yield for the different years of collection. We dropped 3 samples collected in 2008 because of insufficient cases, and recoded Asian (N = 5) and Hispanic (N = 7) samples as race "Other", ie non-White or African-American, giving White = 75, African-American = 22, Other = 20 (N = 5 Asian + 7 Hispanic + 8 coded Other in trial). Results for multivariate analysis of total DNA yield are shown in Table 3. There was no marginal association of yield with race (p = 0.16), gender (p = 0.3), clinical center network (p = 0.28), or lab receipt time (p = 0.16) after adjustment for other factors. However in the same model, year of blood collection was negatively associated with yield (beta = -11.6 ug per year +/- 4.2, p = 0.009), and age at collection was also negatively associated (beta = -2.1 ug per year +/- 0.8, p = 0.015). After dropping the 2001 specimens, year of collection was no longer significant (beta = -0.7+/- 8.3 ug, p = 0.9). Collectively these factors explain 23.5% of the total yield variance (F_12,104 = 2.66, p = 0.004). We discuss the surprising dependency of total yield on year of collection below.

Table 3 Combined linear regression and analysis of variance results for predictor variables of the total DNA yield (ug)

Full size table