<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>jbiol11</ui>
	<ji>1475-4924</ji>
	<fm>
		<dochead>Research article</dochead>
		<bibl>
			<title>
				<p>Adaptive evolution of centromere proteins in plants and animals</p>
			</title>
			<aug>
				<au id="A1">
					<snm>Talbert</snm>
					<mi>B</mi>
					<fnm>Paul</fnm>
					<insr iid="I1"/>
					<email>ptalbert@fhcrc.org</email>
				</au>
				<au id="A2">
					<snm>Bryson</snm>
					<mi>D</mi>
					<fnm>Terri</fnm>
					<insr iid="I1"/>
				</au>
				<au id="A3" ca="yes">
					<snm>Henikoff</snm>
					<fnm>Steven</fnm>
					<insr iid="I1"/>
					<email>steveh@fhcrc.org</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue N, Seattle, WA 98109-1024, USA</p>
				</ins>
			</insg>
			<source>Journal of Biology</source>
			<issn>1475-4924</issn>
			<pubdate>2004</pubdate>
			<volume>3</volume>
			<issue>4</issue>
			<fpage>18</fpage>
			<url>http://jbiol.com/content/3/4/18</url>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">15345035</pubid><pubid idtype="doi">10.1186/jbiol11</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>25</day>
					<month>5</month>
					<year>2004</year>
				</date>
			</rec>
			<revrec>
				<date>
					<day>20</day>
					<month>7</month>
					<year>2004</year>
				</date>
			</revrec>
			<acc>
				<date>
					<day>22</day>
					<month>7</month>
					<year>2004</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>31</day>
					<month>8</month>
					<year>2004</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2004</year>
			<collab>Talbert et al.; licensee BioMed Central Ltd.</collab>
			<note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st>
					<p>Centromeres represent the last frontiers of plant and animal genomics. Although they perform a conserved function in chromosome segregation, centromeres are typically composed of repetitive satellite sequences that are rapidly evolving. The nucleosomes of centromeres are characterized by a special H3-like histone (CenH3), which evolves rapidly and adaptively in <it>Drosophila </it>and <it>Arabidopsis</it>. Most plant, animal and fungal centromeres also bind a large protein, centromere protein C (CENP-C), that is characterized by a single 24 amino-acid motif (CENPC motif).</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st>
					<p>Whereas we find no evidence that mammalian CenH3 (CENP-A) has been evolving adaptively, mammalian CENP-C proteins contain adaptively evolving regions that overlap with regions of DNA-binding activity. In plants we find that CENP-C proteins have complex duplicated regions, with conserved amino and carboxyl termini that are dissimilar in sequence to their counterparts in animals and fungi. Comparisons of <it>Cenpc </it>genes from <it>Arabidopsis </it>species and from grasses revealed multiple regions that are under positive selection, including duplicated exons in some grasses. In contrast to plants and animals, yeast CENP-C (Mif2p) is under negative selection.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusions</p>
					</st>
					<p>CENP-Cs in all plant and animal lineages examined have regions that are rapidly and adaptively evolving. To explain these remarkable evolutionary features for a single-copy gene that is needed at every mitosis, we propose that CENP-Cs, like some CenH3s, suppress meiotic drive of centromeres during female meiosis. This process can account for the rapid evolution and the complexity of centromeric DNA in plants and animals as compared to fungi.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p>Centromeres are the chromosomal loci where kinetochores assemble to serve as attachment sites for the spindle microtubules that direct chromosome segregation during mitosis and meiosis. Despite this essential conserved function in all eukaryotes, centromere structure is highly variable, ranging from the simple short centromeres of budding yeast, which have a consensus sequence of approximately 125 base pairs (bp) on each chromosome, to holokinetic centromeres that span the entire length of a chromosome <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. In plants and animals, centromeres are large and complex, typically comprising megabase-sized arrays of tandemly repeated satellite sequences that are rapidly evolving <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> and may differ significantly between closely related species <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. The failure of conventional cloning and sequencing assembly tools to adequately characterize rapidly evolving satellite sequences at centromeres has made them the last regions of most eukaryotic genomes to be well understood <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>.</p>
			<p>Although there is no discernable conservation of centromeric DNA sequences in disparate eukaryotes, considerable progress has been made in identifying common proteins that form the kinetochore <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. A universal protein component of centromeric chromatin found in all eukaryotes that have been examined is a centromere-specific variant of histone H3 (CenH3), which replaces canonical H3 in centromeric nucleosomes <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. CenH3s are essential kinetochore components yet, like centromeric DNA, they are rapidly evolving <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. In both <it>Drosophila </it><abbrgrp><abbr bid="B9">9</abbr></abbrgrp> and <it>Arabidopsis </it><abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, this rapid evolution of CenH3s is associated with positive selection (adaptive evolution), and involves regions of CenH3 that are predicted to contact the centromeric DNA <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>.</p>
			<p>The finding of positive selection in a protein that is required at every cell division is remarkable. Ancient proteins with conserved function are expected to be under negative selection because they typically have achieved an optimal sequence, so new mutations tend to produce deleterious variants that are quickly eliminated from populations. The canonical histones are extreme examples of this type of protein. In contrast, recurrent positive selection generally occurs as a consequence of genetic conflict, for example in the 'arms race' between pathogen surface antigens and the immune-cell proteins that recognize them. In this case, a mutation in a surface antigen that allows the pathogen to escape detection and proliferate will trigger selection for a new immune receptor to fight the mutated pathogen, which can then mutate again, and so on. The evidence for positive selection of CenH3 proteins specifically in the regions that contact DNA thus suggests a conflict between centromeric DNA and a histone component of the nucleosome that packages it. Is it commonplace for eukaryotes to have such a conflict at their centromeres? Is the conflict unique to centromere-specific histones, or are other proteins that bind centromeres also involved in this conflict? Is conflict responsible for centromere complexity? To answer these questions, we investigated the evolution of a second common DNA-binding kinetochore protein.</p>
			<p>Of the handful of essential kinetochore proteins that are widely distributed among eukaryotes, only one class other than CenH3 has been shown to bind centromeric DNA: centromere protein C (CENP-C), a conserved component of the inner kinetochore in vertebrates <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. Human CENP-C binds DNA non-specifically <it>in vitro </it><abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp> and binds centromeric alpha satellite DNA <it>in vivo </it><abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>. Vertebrate CENP-C and the yeast centromere protein Mif2p <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp> share a 24 amino-acid motif (CENPC motif) that has also been found in kinetochore proteins in nematodes <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> and plants <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. As expected for kinetochore proteins, disruption or inactivation of genes encoding proteins containing a CENPC motif (CENP-Cs) results in the failure of proper chromosome segregation <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>.</p>
			<p>Other than the defining CENPC motif, these proteins are dissimilar in sequence across disparate phyla. Such a small stretch of sequence conservation, accounting for less than 5% of the length of these 549-943 amino-acid proteins, is unexpected considering that CENP-Cs are encoded by essential single-copy genes that are expected to be subject to strong negative selection. We therefore wondered whether the same evolutionary forces responsible for the rapid evolution of CenH3s cause divergence of CENP-Cs outside of the CENPC motif.</p>
			<p>Here, we describe coding sequences from several unreported <it>Cenpc </it>genes and test whether <it>Cenpc </it>genes are in general, like <it>CenH3 </it>genes, subject to positive selection. We find evidence for adaptive evolution of CENP-C in plants and animals, but we find negative selection in yeasts. Our results provide support for a meiotic drive model of centromere evolution.</p>
		</sec>
		<sec>
			<st>
				<p>Results and discussion</p>
			</st>
			<sec>
				<st>
					<p>CenH3s evolve under negative selection in some lineages</p>
				</st>
				<p>Previous work has shown that CenH3s are evolving adaptively in <it>Drosophila </it>and <it>Arabidopsis </it><abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>, but their mode of evolution in mammals is not known. Selective forces acting on proteins can be measured by comparing the estimated rates of nonsynonymous nucleotide substitution (K<sub>a</sub>) and synonymous substitution (K<sub>s</sub>) between coding sequences from closely related species. These rates are expected to be equal if the coding sequences are evolving neutrally (K<sub>a</sub>/K<sub>s </sub>= 1). Negative selection is indicated by K<sub>a</sub>/K<sub>s </sub>&lt; 1, and positive selection is indicated by K<sub>a</sub>/K<sub>s </sub>&gt; 1.</p>
				<p>To obtain a pair of closely related mammalian CenH3s, we used the sequence of the mouse (<it>Mus musculus</it>) CenH3, CENP-A <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, to query the High Throughput Genomic Sequences portion of the GenBank database <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> with a tblastn search, and identified a rat (<it>Rattus norvegicus</it>) genomic clone (AC110465) that contains the predicted rat CENP-A coding sequence. The predicted CENP-A protein is encoded in four exons and is 87% identical in amino-acid sequence to mouse CENP-A, excluding a 25 amino-acid insertion that appears to derive from a duplication of the amino terminus (Figure <figr fid="F1">1</figr>). This gene model is partially supported by an expressed sequence tag (EST; BF561223) that includes the first three exons, but which terminates in the predicted intron 3.</p>
				<fig id="F1">
					<title>
						<p>Figure 1</p>
					</title>
					<caption>
						<p>The rat CENP-A protein</p>
					</caption>
					<text>
						<p>The rat CENP-A protein. <b>(a) </b>Alignment of predicted CENP-A proteins of mammals. Relative to other mammalian CENP-As, rat CENP-A has a 25 amino-acid insertion that arises from a duplication of the amino terminus, shown as over-lined regions. The boundary between the tail and the histone-fold domains (HFD) is indicated below the alignment, along with the position of Loop 1. <b>(b) </b>Alignment of duplicated regions of the rat <it>Cenpa </it>gene (rat1 and rat2) with <it>Cenpa </it>genes of mouse and Chinese hamster. The region that became duplicated in rat extends from upstream of the start codon to codon 22 in mouse and hamster, and is bounded by a conserved dodecamer repeat. The encoded amino acids are shown above (rat1) or below (rat2) the duplicated sequence.</p>
					</text>
					<graphic file="jbiol11-1"/>
				</fig>
				<p>To determine whether <it>Cenpa </it>is evolving adaptively in rodents, we compared K<sub>a </sub>and K<sub>s </sub>between mouse and rat using K-estimator <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Positive selection in single-copy genes that are essential in every cell is expected to be localized and more difficult to detect than in nonessential genes or members of multigene families because of simultaneous negative selection to maintain their essential functions. In <it>Drosophila </it>and <it>Arabidopsis</it>, CenH3s are under positive selection in their tails, but also under negative selection in much of their histone-fold domains. We therefore used the sliding-window function of K-estimator to scan through the coding sequences using 99 bp windows every 33 bp in an effort to find regions of positive selection. This analysis detected statistically significant negative selection for all of the windows except one that failed to rule out neutrality, indicating that CENP-A is under negative selection (K<sub>a </sub>= 0.11, K<sub>s </sub>= 0.33; K<sub>a </sub>&lt; K<sub>s </sub>with <it>p </it>&lt; 0.001) in both the tail and the histone-fold domains. Similar results were obtained when comparing either sequence with the <it>Cenpa </it>gene from Chinese hamster (<it>Cricetulus griseus</it>) <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>, although the greater divergence (K<sub>s </sub>= 0.45 rat, 0.67 mouse) makes the statistical conclusion near the limit of reliability (K<sub>s </sub>&#8804; ~0.5) because of the increased likelihood of multiple substitutions. Thus, CENP-A appears to have been under negative selection throughout its length in multiple rodent lineages.</p>
				<p>We also compared the human <it>Cenpa </it>gene <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> with the <it>Cenpa </it>gene from chimpanzee (<it>Pan troglodytes</it>). A blastn search of the Genome Sequencing Center's assembly of the chimpanzee genome <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> using human <it>Cenpa </it>identified the chimp <it>Cenpa </it>gene encoded in four exons in Contig 286.218. We searched the NCBI trace archives <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> to verify the sequence and the existence of appropriate putative intron splice sites. The predicted chimpanzee <it>Cenpa </it>gene differs from the human gene by six synonymous nucleotide substitutions and an indel (insertion or deletion) of two codons. This excess of synonymous substitutions indicates negative selection of CENP-A (<it>p </it>&lt; 0.01). Overall negative selection of CENP-A appears also to extend to the bovine (CB455530) protein, given the relatively high degree of conservation seen for all regions, including the tail and Loop 1 regions that evolve adaptively in <it>Drosophila </it>(Figure <figr fid="F1">1a</figr>).</p>
				<p>We also found overall negative selection in CenH3s of grasses. We used the <it>CENH3 </it>gene (AF519807) of maize (<it>Zea mays</it>) <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> to search ESTs <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> from sugarcane (<it>Saccharum officinarum</it>), and identified three that encode full-length <it>CENH3 </it>genes (CA119873, CA127217, and CA142604). The CenH3 proteins encoded by these ESTs differ from each other by 2-4 amino acids. Because sugarcane is thought to be octaploid, these variants may represent co-expressed homeologs. The coding regions of ESTs CA119873 and CA127217 differ by four synonymous and four nonsynonymous substitutions (K<sub>s </sub>= 0.03, K<sub>a </sub>= 0.01), suggesting negative selection. Comparison of either of these sequences with maize <it>CENH3 </it>by sliding-window analysis found that all windows had K<sub>s </sub>&gt; K<sub>a</sub>, with overall negative selection (K<sub>s </sub>= 0.24, K<sub>a </sub>= 0.13; <it>p </it>&lt; 0.01). Thus, in contrast to CenH3s in <it>Arabidopsis </it>and <it>Drosophila</it>, CenH3s of rodents, primates, and grasses appear not to be evolving adaptively.</p>
				<p>The evident lack of positive selection on CenH3 in mammals and grasses raises the possibility that another kinetochore protein is evolving in conflict with centromeric DNA in these organisms, in which centromeric satellite sequences are known to be evolving rapidly <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B38">38</abbr></abbrgrp>. We focused on CENP-C, which is found to co-localize with CenH3 to the inner kinetochore in humans <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> and maize <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Mammalian CENP-C is evolving adaptively</p>
				</st>
				<p>To address the possibility that CENP-C is adaptively evolving in mammals, we used the mouse sequence <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> as a query in a tblastn search to identify <it>Cenpc </it>ESTs from rat. From these ESTs (see Additional data file 1), we obtained and sequenced a full-length cDNA (see Additional data file 2), and compared its coding sequence with that of the mouse <it>Cenpc </it>gene (68% predicted amino-acid identity). We found positive selection over most of the amino-terminal two-thirds of the coding sequence, interrupted by one region of significant negative selection (mouse codons 208-273), one region of nearly significant negative selection (mouse 410-464), and three short regions without significant selection (Figure <figr fid="F2">2a</figr>; Table <tblr tid="T1">1</tblr>). Most of the carboxy-terminal one-third of the protein, including the CENPC motif and an additional region that is homologous to the budding yeast CENP-C protein Mif2p <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>, has been under negative selection. We conclude that at least some regions of <it>Cenpc </it>genes are evolving adaptively in rodents.</p>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p>Sliding-window analysis of K<sub>a</sub>/K<sub>s </sub>for selected pairs of <it>Cenpc </it>genes</p>
					</caption>
					<text>
						<p>Sliding-window analysis of K<sub>a</sub>/K<sub>s </sub>for selected pairs of <it>Cenpc </it>genes. Each point represents the value of K<sub>s</sub>, K<sub>a</sub>, or K<sub>a</sub>/K<sub>s </sub>for a 99 nucleotide (33 codon) window plotted against the codon position of the midpoint of the window. K<sub>a</sub>/K<sub>s </sub>is not defined where K<sub>s </sub>= 0. The aligned coding sequence is represented at the top of each graph, with the CENPC motif represented by a filled rectangle; exons are also indicated for the plant sequences. Regions of statistically significant positive selection (black bars) and negative selection (gray bars) are marked. <b>(a) </b>Rat and mouse. The interrupted gray bar indicates that <it>p </it>= 0.06 for this region. <b>(b) </b><it>Arabidopsis thaliana </it>and <it>Arabidopsis arenosa</it>. <b>(c) </b>Maize (<it>CenpcA</it>) and <it>Sorghum bicolor</it>. <b>(d) </b>Wheat and barley, exons 9p-14.</p>
					</text>
					<graphic file="jbiol11-2"/>
				</fig>
				<tbl id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>Pairwise comparison of mouse and rat <it>Cenpc </it>genes</p>
					</caption>
					<tblbdy cols="4">
						<r>
							<c ca="left">
								<p>Human</p>
							</c>
							<c ca="center">
								<p>Mouse</p>
							</c>
							<c ca="center">
								<p>Rat</p>
							</c>
							<c ca="center">
								<p>Selection</p>
							</c>
						</r>
						<r>
							<c cspan="4">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>1-86</p>
							</c>
							<c ca="center">
								<p>1-84</p>
							</c>
							<c ca="center">
								<p>1-77</p>
							</c>
							<c ca="center">
								<p>+*</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>109-248</p>
							</c>
							<c ca="center">
								<p>107-218</p>
							</c>
							<c ca="center">
								<p>100-236</p>
							</c>
							<c ca="center">
								<p>+**</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>239-304</p>
							</c>
							<c ca="center">
								<p>208-273</p>
							</c>
							<c ca="center">
								<p>226-291</p>
							</c>
							<c ca="center">
								<p>-**</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>294-353</p>
							</c>
							<c ca="center">
								<p>263-321</p>
							</c>
							<c ca="center">
								<p>281-335</p>
							</c>
							<c ca="center">
								<p>+*</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>411-455</p>
							</c>
							<c ca="center">
								<p>377-420</p>
							</c>
							<c ca="center">
								<p>391-434</p>
							</c>
							<c ca="center">
								<p>+**</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>445-497</p>
							</c>
							<c ca="center">
								<p>410-464</p>
							</c>
							<c ca="center">
								<p>424-478</p>
							</c>
							<c ca="center">
								<p>- (0.06)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>487-552</p>
							</c>
							<c ca="center">
								<p>454-519</p>
							</c>
							<c ca="center">
								<p>468-533</p>
							</c>
							<c ca="center">
								<p>+**</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>565-670</p>
							</c>
							<c ca="center">
								<p>531-633</p>
							</c>
							<c ca="center">
								<p>545-643</p>
							</c>
							<c ca="center">
								<p>+**</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>671-790</p>
							</c>
							<c ca="center">
								<p>634-754</p>
							</c>
							<c ca="center">
								<p>644-764</p>
							</c>
							<c ca="center">
								<p>-**</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>858-934</p>
							</c>
							<c ca="center">
								<p>821-897</p>
							</c>
							<c ca="center">
								<p>831-907</p>
							</c>
							<c ca="center">
								<p>-*</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>+ denotes K<sub>a </sub>&gt; K<sub>s</sub>; -, K<sub>a </sub>&lt; K<sub>s</sub>; * <it>p </it>&lt; 0.05; ** <it>p </it>&lt; 0.01. Number ranges represent codon positions based on the complete coding sequences prior to removal of indels for alignment. Human codon positions are given for comparison with previous functional studies. Number in parentheses is a <it>p </it>value greater than 0.05.</p>
					</tblfn>
				</tbl>
				<p>To determine whether any of these regions is also under positive selection in primates, we identified the <it>Cenpc </it>gene of chimpanzee by using the human <it>Cenpc </it>coding sequence (GenBank accession number M95724) to search the assembled chimpanzee genome and the NCBI trace archives. We found that the chimpanzee genome contains a single copy of the <it>Cenpc </it>structural gene (contigs 375.88-375.100), as well as a processed <it>Cenpc </it>pseudogene (contigs 76.642-76.643), as has been found in humans <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B18">18</abbr><abbr bid="B39">39</abbr></abbrgrp>. The predicted chimpanzee <it>Cenpc </it>coding sequence differs by 17 nucleotide substitutions from the human cDNA sequence, with K<sub>s </sub>= 0.0054 and K<sub>a </sub>= 0.0063. The &gt; 99% identity of the human and chimp coding sequences provides little opportunity to detect selection, but using sliding-window analysis we found a single region of significant positive selection (human codons 278-585) that overlaps the central regions of positive selection found in the more divergent rat-mouse comparison, indicating that the central portion of CENP-C is under positive selection in both rodents and primates.</p>
				<p>To confirm these results, we applied the codeml program of PAML <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> to a multiple sequence alignment of mammalian CENP-Cs. PAML calculates the likelihood of models for neutral and adaptive evolution based on a tree and estimates K<sub>a</sub>/K<sub>s </sub>ratios. We compared the null model with two fixed site classes (K<sub>a</sub>/K<sub>s </sub>= 0 or 1) to a 'data-driven' model in which two classes of sites were estimated from the data. The data-driven model was found to be significantly more probable than the null model (&#967;<sup>2 </sup>= 8.7; <it>p </it>= 0.01) with K<sub>a</sub>/K<sub>s </sub>= 0.20 for 57% of the 685 sites in the multiple alignment and K<sub>a</sub>/K<sub>s </sub>= 1.64 for 43% of the sites (data not shown). Similar results were obtained using either a DNA- or a protein-based tree, or testing more complex models. When the same tests were applied to the core region of 11 aligned Brassicaceae (mustard family) CenH3s, only 17% of residues were estimated to be in the positive selection class (K<sub>a</sub>/K<sub>s </sub>= 2.54) (<abbrgrp><abbr bid="B11">11</abbr></abbrgrp> and data not shown), which indicates that positive selection on mammalian CENP-C has occurred more extensively than on CenH3s.</p>
				<p>Amino-acid sites of positive selection in mammalian CENP-Cs were identified as those with significant posterior probabilities. These were found to be scattered throughout the multiply aligned region with 5 of the 18 highly significant sites prominently clustered within 25 residues (human codons 424-448) in a region of positive selection identified by K-estimator analysis. Therefore, pairwise K-estimator and multiple PAML analyses yield similar results and reveal that large regions of mammalian CENP-Cs have been adaptively evolving.</p>
			</sec>
			<sec>
				<st>
					<p>Adaptively evolving regions overlap DNA-binding and centromere-targeting regions</p>
				</st>
				<p>The regions of positive selection in rodent and primate CENP-Cs overlap some protein landmarks identified in functional analyses of human CENP-C. The binding activity of human CENP-C to DNA <it>in vitro </it>has been mapped by two groups of investigators. Sugimoto and colleagues <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp> found that the region including amino acids 396-498 bound DNA and was stabilized by including flanking amino acids on one or both sides (330-498 or 396-581; Figure <figr fid="F3">3a</figr>), suggesting that at least two regions in the central portion of the protein contribute to DNA binding. Yang and colleagues <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> identified two non-overlapping DNA-binding regions: amino acids 23-440 and 459-943. They found a weak DNA-binding activity at the carboxyl terminus in region 638-943, which includes the CENPC motif (737-759) and the conserved Mif2p-homologous region (890-941). This suggests that region 459-943 itself contains at least two DNA-binding regions, a weak one at region 638-943, and a stronger one that may correspond to region 396-581 described by Sugimoto and colleagues. Both the central region and the carboxyl terminus have been shown to bind DNA <it>in vivo </it><abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. Comparison of the regions of positive selection found in rodents and primates with these DNA-binding regions reveals extensive overlap with the central DNA-binding regions (Figure <figr fid="F3">3a</figr>), including the cluster of highly significant sites between codons 424 and 448 identified by PAML analysis. This is consistent with previous evidence that adaptive evolution of CenH3s occurs in regions that have been implicated in DNA binding <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B11">11</abbr></abbrgrp>. No positive selection was observed for the poorly mapped carboxy-terminal DNA-binding domain in our sliding-window analysis, suggesting either that this DNA-binding domain is not evolving adaptively or that strong negative selection on the CENPC motif can obscure detection by our sliding-window analysis of positive selection on nearby amino acids that contact centromeric DNA. In the DNA-binding Loop 1 region of <it>Arabidopsis </it>CenH3, adaptively evolving codons are found in close proximity to codons under strong negative selection <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>.</p>
				<fig id="F3">
					<title>
						<p>Figure 3</p>
					</title>
					<caption>
						<p>Comparisons of CENP-C proteins in animals, yeast and plants</p>
					</caption>
					<text>
						<p>Comparisons of CENP-C proteins in animals, yeast and plants. The CENPC motif and conserved regions found at the termini of CENP-C proteins are indicated. For pairwise comparisons of protein-coding sequences, regions of positive and negative selection between the species compared are shown. <b>(a) </b>Alignment of animal and fungal CENP-Cs. Mammalian CENP-Cs align throughout their lengths, as do the two <it>Saccharomyces </it>Mif2p proteins, but others align only at conserved regions. Portions of the human CENP-C protein implicated in centromere-targeting (purple bars) and DNA-binding (black bars) are shown at the top. The scale bar at the top marks the length of human CENP-C in amino acids. <b>(b) </b>Alignment of plant CENP-Cs. Within angiosperm families, proteins align throughout their lengths. Between families, weak conservation is found at the amino terminus and strong conservation at the carboxyl terminus. <b>(c) </b>Logos representation of an alignment of the CENPC motif from human; mouse; cow; chicken; <it>Caenorhabditis elegans</it>; budding yeast; <it>Schizosaccharomyces pombe</it>; <it>Physcomitrella patens</it>; maize <it>CenpcA</it>; rice; <it>A. thaliana</it>; black cottonwood, soybean, and tomato.</p>
					</text>
					<graphic file="jbiol11-3"/>
				</fig>
				<p>In human CENP-C, three regions have been reported to confer centromere targeting. One targeting signal was recently reported in region 283-429 <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. A second targeting region was mapped by mutation to region 522-534, with arginine 522 crucial for localization <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. Targeting by the conserved carboxyl terminus (728-943) occurs for species as distant as <it>Xenopus </it><abbrgrp><abbr bid="B21">21</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>. A segment that includes both the first and second targeting regions (1-584) failed to confer targeteting to centromeres in hamster BHK cells, however <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. We find that these two targeting regions are within the region of positive selection in primates and overlap with three of the regions of positive selection in rodents. A correspondence between centromere targeting and adaptive evolution has been noted for <it>Drosophila </it>CenH3, where the adaptively evolving Loop 1 region has been shown to be necessary and sufficient for targeting when swapped between native and heterologous orthologs <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Therefore, the lack of centromeric targeting of a human CENP-C fragment containing the first and second targeting regions in the heterologous hamster system might be attributed to adaptive evolution of DNA-binding specificity in these regions.</p>
				<p>Targeting of native CENP-C proteins depends on other centromere proteins that vary according to species <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>, but the dependence of CENP-Cs on CenH3s for targeting appears to be universal <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp>. This dependence suggests that CENP-C proteins contain a conserved CenH3-interacting region, for which the CENPC motif is the only obvious candidate. The first half of the CENPC motif is rich in arginines, whereas the second half has mixed chemical properties including three aromatic residues (Figure <figr fid="F3">3c</figr>). In the non-specific binding of nucleosome cores to DNA, 14 DNA contacts are made by arginines binding to the minor groove <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. This suggests that the weak DNA binding of the carboxyl terminus of CENP-C may be mediated by the arginines of the CENPC motif, with the remainder of the motif contacting a conserved structural feature of centromeric nucleosomes.</p>
				<p>Not all regions of CENP-C that display positive selection correspond to regions that bind DNA <it>in vitro </it>or that are sufficient for targeting centromeres. For example, the region comprising the most amino-terminal 200 or so amino acids of rodent CENP-C has been evolving adaptively, but the orthologous region in human CENP-C fails to bind DNA in a southwestern assay <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B19">19</abbr></abbrgrp> or to localize to centromeres of human embryonic kidney cells <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. This suggests that the amino-terminal region of CENP-C plays a supporting role in packaging centromeric chromatin. A parallel situation appears to hold for the adaptively evolving amino-terminal tail of <it>Drosophila </it>CenH3, which was found to be neither necessary nor sufficient for targeting <it>in vivo </it>to homologous centromeres. In this case, Loop 1 was identified as the targeting domain, and the amino-terminal tail was hypothesized to help stabilize higher-order chromatin structure by binding to linker DNA, similar to the known binding activity of canonical histone tails <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. If CENP-C in mammals is subject to the same evolutionary forces that shape the adaptive evolution of the CenH3 tail in <it>Drosophila</it>, then CENP-C might be playing a comparable role in the stabilization of higher-order centromeric chromatin.</p>
				<p>Positive selection in the central DNA-binding and centromere-targeting region of CENP-C offers an explanation for the lack of conservation of this region between chicken and mammals <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>: as positive selection acts on the amino acids that contact rapidly evolving centromeric satellites and that serve to target the protein to a specific but ever-changing substrate, it may eventually erase all recognizable homology in these protein regions.</p>
			</sec>
			<sec>
				<st>
					<p><it>Cenpc </it>gene structure and conservation in plants</p>
				</st>
				<p>Our finding that adaptive evolution is occurring in animal CENP-Cs encouraged a similar survey of plant CENP-Cs, because centromeres from both animals and seed plants comprise rapidly evolving satellite sequences. At the time we began this study, <it>Cenpc </it>genes in plants had been characterized only in maize (<it>Z. mays</it>), so we needed first to identify <it>Cenpc </it>homologs from other plants to ascertain whether or not the gene is evolving adaptively.</p>
				<p>Three <it>Cenpc </it>homologs have been described in maize: <it>CenpcA, CenpcB, </it>and <it>CenpcC </it><abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Immunological localization of CENP-CA to maize centromeres indicates that it is probably functional, so plant relatives of maize CENP-CA should also represent CENP-Cs. We used the CENP-CA protein sequence (AAD39434) as a query in a tblastn search of GenBank, and identified a single <it>Cenpc </it>homolog (AC013453, At1g15660) in the genome of <it>Arabidopsis thaliana </it>by sequence similarity at both protein termini (Figure <figr fid="F4">4</figr>). Isolation and sequencing of a full-length <it>Cenpc </it>cDNA (Additional data file 2) revealed that the 705 amino-acid CENP-C protein of <it>Arabidopsis </it>is encoded in 11 exons, with the CENPC motif encoded in exon 10 (Figure <figr fid="F5">5</figr>). Recently, <it>Arabidopsis </it>CENP-C has been found to localize to <it>Arabidopsis </it>centromeres <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>.</p>
				<fig id="F4">
					<title>
						<p>Figure 4</p>
					</title>
					<caption>
						<p>Alignment of conserved regions of angiosperm CENP-C predicted proteins</p>
					</caption>
					<text>
						<p>Alignment of conserved regions of angiosperm CENP-C predicted proteins. <b>(a) </b>Short regions of conservation are encoded in the first six exons of <it>Cenpc </it>genes from five families. The dipeptide SQ (underlined) is relatively frequent in exon 5. <b>(b) </b>Multiple alignment reveals strong conservation in the carboxyl termini of encoded proteins from six families. The CENPC motif is indicated. At, <it>A. thaliana</it>; Mt, barrel medic; Os, rice; Zm, maize CENP-CA; St, potato; SLe, tomato; Bv, beet; Pbt, black cottonwood.</p>
					</text>
					<graphic file="jbiol11-4"/>
				</fig>
				<fig id="F5">
					<title>
						<p>Figure 5</p>
					</title>
					<caption>
						<p>Gene models of selected plant <it>Cenpc </it>genes</p>
					</caption>
					<text>
						<p>Gene models of selected plant <it>Cenpc </it>genes. Exon/intron structure is conserved across families from exon 1 through the beginning of exon 6, and for the final two exons and introns. Exon sizes are given to the nearest codon where genomic sequence is available to confirm predicted exons. Duplicated exons are indicated by gray shading.</p>
					</text>
					<graphic file="jbiol11-5"/>
				</fig>
				<p>We searched the GenBank EST database, querying with the predicted protein sequences of maize CENP-CA and <it>Arabidopsis </it>CENP-C. We identified ESTs from putative plant <it>Cenpc </it>genes in 20 angiosperm species representing eight families and in the moss <it>Physcomitrella patens </it>(see Additional data file 1). We obtained the cDNA clones corresponding to 16 of these ESTs and sequenced them completely (see Additional data file 2). An alignment of the carboxyl termini encoded by cDNAs representing six angiosperm families revealed that the final 80 or so amino acids of CENP-C, including the CENPC motif, are highly conserved in plants (Figure <figr fid="F4">4b</figr>). For comparison, the carboxyl termini of vertebrate CENP-C proteins have approximately 180 amino acids following the CENPC motif (Figure <figr fid="F3">3a</figr>), including a block of 52 amino acids that is conserved in yeast Mif2p <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>, but not in nematodes <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. The carboxyl termini of plant CENP-Cs do not show significant similarity to animal and fungal CENP-Cs except for the CENPC motif.</p>
				<p>As an aid in identifying other conserved regions of angiosperm CENP-Cs, we developed gene models for full-length <it>Cenpc </it>cDNAs by aligning them with available genomic sequences (Additional data file 1). A full-length cDNA from barrel medic (<it>Medicago truncatula</it>) encodes a protein of 697 amino acids, which corresponds to a gene model of eleven exons when aligned to a genomic pseudogene (Figure <figr fid="F5">5</figr>). We also predicted gene models for <it>Cenpc </it>genes in the grasses using cDNAs and genomic sequences from rice (<it>Oryza sativa</it>), maize, and sorghum (<it>Sorghum bicolor</it>) (Figure <figr fid="F5">5</figr>). The maize gene model of 14 exons suggests an explanation for the anomalous maize cDNA '<it>CenpcC</it>' (AF129859) <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, which differs from all other plant <it>Cenpc</it>s in encoding an unrelated carboxyl terminus. <it>CenpcC </it>is 99.9% identical to maize <it>CenpcA </it>until it diverges downstream of the CENPC motif at the point corresponding to the end of exon 13 in our gene model. On the basis of an overlap with maize and <it>Sorghum </it>genomic sequence that spans the intron between exons 13 and 14, we conclude that the divergent 3' end of <it>CenpcC </it>derives from the unspliced intron 13 of <it>CenpcA</it>, and that all angiosperm CENP-Cs share a highly conserved carboxyl terminus.</p>
				<p>Comparing the gene models of <it>Arabidopsis</it>, barrel medic, maize, <it>Sorghum</it>, and rice, the limited conservation of the encoded amino-acid sequences and approximate correspondence of exon sizes suggest that the exons in the amino-terminal half and the final two exons of plant CENP-C are conserved (Figures <figr fid="F3">3</figr>,<figr fid="F5">5</figr>). The middle region does not show conservation of intron position or encoded peptide sequence, indicating rapid evolution within angiosperms. We assumed conservation of the first five intron positions in the 5' half of the coding sequence to generate an amino-terminal alignment that represents five families, including the protein encoded by a beet (<it>Beta vulgaris</it>) cDNA that appears to contain an unspliced intron. Our alignment reveals short regions of conservation throughout the amino terminus, as well as a high relative incidence of the dipeptide SQ in the poorly conserved exon 5 (Figure <figr fid="F4">4</figr>).</p>
				<p>Despite these short regions of conservation within angiosperms, no sequence similarity between plant and animal CENP-Cs could be detected outside of the CENPC motif. Nevertheless, plant and animal CENP-Cs appear to share an overall architecture (Figure <figr fid="F3">3</figr>). Both angiosperm and vertebrate CENP-Cs <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> have regions of conservation at the amino and carboxyl termini, with little or no conservation in the middle region of the protein. Remarkably, plant and animal CENP-Cs also share the same modular exon organization for the CENPC motif, which lies within a 105-108 bp exon (encoding 35-36 amino acids) that is spliced in the same frame in both plants and animals (see Additional data file 3). Considering the similar overall lengths of plant and animal CENP-Cs, the arrangement of conserved regions, and the common location of the CENPC module, it appears that corresponding regions of the protein are evolving similarly and may serve similar functions.</p>
			</sec>
			<sec>
				<st>
					<p>Recurrent exon duplications in the grasses</p>
				</st>
				<p>Multiple alignment of plant <it>Cenpc</it>s revealed that one region of the gene is subject to duplication, but only in grasses. One part of the poorly conserved middle region of the gene has been repeatedly duplicated and deleted, thus encoding proteins of different sizes. In rice, an ancestral pair of exons, corresponding to exons 9 and 10 in maize <it>CenpcA</it>, has been triplicated in tandem (Figure <figr fid="F5">5</figr>). To facilitate comparison with maize and other grasses, we designated the rice exons as 9a-10a, 9b-10b, and 9c-10c. Exon 9c has an additional internal tandem duplication of its first 14 codons. Consensus sequences derived from overlapping truncated ESTs (Additional data file 1) and cDNAs (Additional data file 2) from the closely related species wheat (<it>Triticum aestivum</it>) and barley (<it>Hordeum vulgare</it>) indicate that there are two tandem copies of exons 9 and 10 in these species (designated 9p-10p and 9q-10q in Figure <figr fid="F5">5</figr>). We confirmed the sequence of these exons by designing primers and amplifying the corresponding regions from wheat and barley genomic DNAs. Single copies of exons 9 and 10 were found in full-length cDNAs from sugarcane, <it>Sorghum bicolor </it>and <it>Sorghum propinquum </it>(Table <tblr tid="T2">2</tblr>; Figure <figr fid="F5">5</figr>).</p>
				<tbl id="T2">
					<title>
						<p>Table 2</p>
					</title>
					<caption>
						<p>Regions of selection in pairwise comparisons of maize <it>CenpcA, Sorghum bicolor Cenpc</it>, and sugarcane <it>Cenpc1</it></p>
					</caption>
					<tblbdy cols="5">
						<r>
							<c ca="left">
								<p>Exons</p>
							</c>
							<c ca="center">
								<p>Direction of selection</p>
							</c>
							<c ca="center">
								<p>Maize <it>vs. Sorghum</it></p>
							</c>
							<c ca="center">
								<p>Maize <it>vs. </it>sugarcane</p>
							</c>
							<c ca="center">
								<p><it>Sorghum vs. </it>sugarcane</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>1</p>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
							<c ca="center">
								<p>12-44</p>
							</c>
							<c ca="center">
								<p>12-44</p>
							</c>
							<c ca="center">
								<p>1-42</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
							<c ca="center">
								<p>+ (0.17)</p>
							</c>
							<c ca="center">
								<p>+ (0.04)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>1-5</p>
							</c>
							<c ca="center">
								<p>-</p>
							</c>
							<c ca="center">
								<p>34-165</p>
							</c>
							<c ca="center">
								<p>23-176</p>
							</c>
							<c ca="center">
								<p>87-163</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>-</p>
							</c>
							<c ca="center">
								<p>-</p>
							</c>
							<c ca="center">
								<p>-</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>4-6</p>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
							<c ca="center">
								<p>155-253</p>
							</c>
							<c ca="center">
								<p>166-253</p>
							</c>
							<c ca="center">
								<p>153-317</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>6</p>
							</c>
							<c ca="center">
								<p>*</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>232-286</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>-</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>6</p>
							</c>
							<c ca="center">
								<p>-</p>
							</c>
							<c ca="center">
								<p>298-363</p>
							</c>
							<c ca="center">
								<p>298-363</p>
							</c>
							<c ca="center">
								<p>298-352</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>-</p>
							</c>
							<c ca="center">
								<p>-</p>
							</c>
							<c ca="center">
								<p>- (0.13)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>6</p>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
							<c ca="center">
								<p>353-409</p>
							</c>
							<c ca="center">
								<p>342-407</p>
							</c>
							<c ca="center">
								<p>397-431</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
							<c ca="center">
								<p>+ (0.06)</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>6-12</p>
							</c>
							<c ca="center">
								<p>-</p>
							</c>
							<c ca="center">
								<p>410-621</p>
							</c>
							<c ca="center">
								<p>397-630</p>
							</c>
							<c ca="center">
								<p>432-579</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>-</p>
							</c>
							<c ca="center">
								<p>-</p>
							</c>
							<c ca="center">
								<p>-</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>12-14</p>
							</c>
							<c ca="center">
								<p>*</p>
							</c>
							<c ca="center">
								<p>611-687</p>
							</c>
							<c ca="center">
								<p>609-685</p>
							</c>
							<c ca="center">
								<p>591-700</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
							<c ca="center">
								<p>-</p>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>+, K<sub>a </sub>&gt; K<sub>s</sub>; -, K<sub>a </sub>&lt; K<sub>s</sub>; <it>p </it>&#8804; 0.01 except where given in parentheses. * Direction of selection varies with lineage. Regions of selection are identified by codon positions based on the sequence of maize <it>CenpcA</it>. </p>
					</tblfn>
				</tbl>
				<p>Exon duplications were also found for <it>Sorghum </it>species but, surprisingly, these involved a different pair of exons, 11 and 12. One full-length cDNA from <it>S. bicolor </it>has only a single copy of exons 11 and 12, whereas a truncated pseudogene from <it>S. bicolor </it>and a full-length cDNA from <it>S. propinquum </it>are duplicated for exons 11 and 12 (designated 11a-12a and 11b-12b). The <it>S. bicolor </it>pseudogene has a deletion that joins sequences just upstream of the initiation codon in exon 1 to sequences upstream of exon 2. Despite the presence of tandemly duplicated exons, the <it>S. bicolor </it>truncated pseudogene is more closely related to the full-length <it>S. bicolor </it>gene than it is to the <it>S. propinquum </it>gene. Exons 11 and 12 in the <it>S. bicolor </it>full-length gene are identical to 11b-12b in the pseudogene, but have 7 differences from 11a-12a. This suggests that the duplication of exons 11 and 12 preceded the divergence of <it>S. propinquum </it>and <it>S. bicolor</it>, and that the full-length <it>S. bicolor </it>gene may have been derived by loss of exons 11a-12a from a full-length ancestral gene similar to the truncated pseudogene.</p>
				<p>We wondered why two different pairs of exons, 9-10 and 11-12, were each independently subject to duplication in the grasses. When we examined multiple alignments of the peptide sequences encoded by both exon pairs in Logos format, it became apparent that they resembled each other in length and composition (Figure <figr fid="F6">6a</figr>). Exons 9 and 11 both encode peptides of 25-28 residues that are rich in acidic amino acids, whereas exons 10 and 12 encode peptides of 30-38 residues that are rich in basic amino acids. We compared alignments of exons 9 and 11 and alignments of exons 10 and 12 using the Local Alignment of Multiple Alignments (LAMA) program, and found that these exon pairs appear to be homologous (<it>E </it>&lt; 0.0001 for both comparisons). We conclude that exon pairs 9-10 and 11-12 derive from a more ancient duplication event.</p>
				<fig id="F6">
					<title>
						<p>Figure 6</p>
					</title>
					<caption>
						<p>CENP-C exon repeats in the grasses</p>
					</caption>
					<text>
						<p>CENP-C exon repeats in the grasses. <b>(a) </b>Alignments of copies of the duplicated exons 9, 10, 11, and 12 from the grass species in this study, excluding pseudogenes, are shown in Logos format. <b>(b) </b>A neighbor-joining phylogram (with gaps excluded) of the exon pairs 9-10 and 11-12 in grass species. A parsimony tree gave essentially the same topology. Dots indicate the locations of inferred duplication events in the tree. Presumed pseudogenes are marked with &#968;. <b>(c) </b>Schematic representation of exon duplication events leading to various <it>Cenpc </it>gene structures, and examples of grass species with these structures. Pairs of arrows indicate duplication events; lines terminating in a filled circle indicate loss of an exon pair in derivatives.</p>
					</text>
					<graphic file="jbiol11-6"/>
				</fig>
				<p>To trace the likely ancestry of these duplication events, we used an alignment of the exons from multiple species to construct phylogenetic trees of duplicates of exons 9-10 and 11-12 (Figure <figr fid="F6">6b</figr>). This phylogeny suggests that there have been numerous duplication events in the history of the grasses (Figure <figr fid="F6">6c</figr> and data not shown): first, a duplication generating exons 9-10 and 11-12 in an ancestor of the grasses; second, a duplication generating exons 9p-10p and 9q-10q; third, a duplication generating exons 11a-12a and 11b-12b in the <it>Sorghum </it>lineage; fourth, two duplications generating rice exons 9a-10a, 9b-10b, and 9c-10c all within the rice 9q-10q lineage; and fifth, a partial duplication in rice exon 9c.</p>
				<p>There also appear to have been at least three losses of duplications: one of exons 11a-12a in the lineage leading to the full-length <it>S. bicolor </it>gene, one of exons 11b-12b in the sugarcane genes, and one of the hypothetical rice 9p-10p. Alternatively, it is possible that the latter loss and one of the rice-specific duplications resulted from gene conversion of rice 9p-10p by a derivative of rice 9q-10q. Regardless of the exact number of duplication and deletion events, it is clear that the exon pair ancestral to grass exons 9-10 and 11-12 has been subjected to repeated episodes of duplication and deletion.</p>
			</sec>
			<sec>
				<st>
					<p>Plant CENP-Cs are adaptively evolving</p>
				</st>
				<p>The delineation of gene models for plant <it>Cenpc</it>s allowed us to analyze them for evidence of adaptive evolution. First, we compared <it>Cenpc</it>s from <it>Arabidopsis </it>species in which we had previously found adaptively evolving CenH3s. Using the <it>A. thaliana </it>genomic sequence to design primers, we amplified, cloned, and sequenced a <it>Cenpc </it>cDNA from <it>A. arenosa </it>(Additional data file 2). Comparing this sequence with that of <it>A. thaliana, </it>the predicted proteins differ by 87 amino-acid subtitutions out of 703 alignable residues, plus five indels of 1-3 amino acids.</p>
				<p>We applied the sliding window option of K-estimator to the aligned coding sequences of <it>A. thaliana </it>and <it>A. arenosa Cenpc</it>. At three regions, K<sub>a </sub>exceeded its 99% confidence interval for the null hypothesis, indicating that these regions are under positive selection (Figures <figr fid="F2">2b</figr>,<figr fid="F3">3</figr>). These regions correspond approximately to exon 5 (codons 178-221 in the <it>A. thaliana </it>sequence), the 3' half of exon 6 (codons 376-441), and exons 8 and 9 (codons 486-618). In addition, a region encompassing most of exons 1 and 2 (codons 24-89) was found to be under positive selection with <it>p </it>&lt; 0.03. We also determined that the 5' half of exon 6 (codons 255-386) and the conserved exons 10 and 11 (codons 595-703) are under negative selection with <it>p </it>&lt; 0.01.</p>
				<p>Curiously, an indel at the beginning of exon 9, where the <it>A. arenosa </it>cDNA has a CAG (glutamine) codon that is absent in the <it>A. thaliana </it>cDNA, appears to be caused by the species-specific use of alternative acceptor splice sites, because the genomic sequence (data not shown) at this intron-exon boundary is identical in both species (...cag cag ^GAG GGT... or ...cag ^CAG GAG GGT...). The presence of species-specific alternative splicing of the same codon in an adaptively evolving region suggests that splicing variation can contribute to adaptive variability.</p>
				<p>To examine whether positive selection in <it>Cenpc </it>is unique to <it>Arabidopsis </it>or occurs more generally in plants, we compared <it>Cenpc </it>genes from the two <it>Sorghum </it>species. We removed the duplicate exons 11a and 12a from the <it>S. propinquum </it>coding sequence in order to compare the sequence with the full-length gene from <it>S. bicolor </it>. For this comparison, K<sub>a </sub>= 0.014 and K<sub>s </sub>= 0.003. K<sub>a </sub>exceeds the 99% confidence interval of the null hypothesis, and neutral evolution can be rejected in favor of positive selection. The limited divergence between these two sequences did not allow statistically significant conclusions to be drawn about positive selection in particular regions of the gene.</p>
				<p>To address which regions are under positive selection, we compared the <it>S. bicolor </it>sequence with the maize <it>CenpcA </it>coding sequence (75% amino-acid identity). Between maize <it>CenpcA </it>and <it>S. bicolor, </it>K<sub>a </sub>= 0.12, K<sub>s </sub>= 0.14, and there are seven indels of 1-11 codons. We identified positive selection for a single window in exon 1, for a region including all of exon 5, for a region in the second half of exon 6, and for a region from the end of exon 12 through most of exon 14 (Table <tblr tid="T2">2</tblr> and Figure <figr fid="F2">2c</figr>). Negative selection was found for a region from exons 1-4, a region in the middle one-third of exon 6, and a region from the end of exon 6 through exon 12 (Table <tblr tid="T2">2</tblr> and Figure <figr fid="F2">2c</figr>). The regions of positive selection seen in exons 1 and 5 clearly overlap the corresponding regions in <it>Arabidopsis </it>(Figure <figr fid="F3">3b</figr>). Although the region of positive selection seen in exon 6 of the grasses cannot be aligned with that in exon 6 of <it>Arabidopsis </it>because of sequence divergence, they occur in the same general area of the protein.</p>
				<p>The region of positive selection in exons 12-14 was somewhat surprising given the strong conservation around the CENPC motif, and we wondered if this selection was specific to the maize or <it>Sorghum </it>lineage. To test this possibility, we compared maize <it>CenpcA </it>and <it>S. bicolor Cenpc </it>with a <it>Cenpc </it>gene from sugarcane. Of three sugarcane cDNAs that we obtained (Additional data file 2), two had identical coding sequences (<it>Cenpc1</it>), and the third (<it>Cenpc2</it>) differed by 13 nucleotide substitutions, suggesting that <it>Cenpc1 </it>and <it>Cenpc2 </it>may be homeologous genes in the polyploid sugarcane genome. We compared <it>Cenpc1 </it>to maize <it>CenpcA </it>and <it>S. bicolor Cenpc</it>. Regions of positive or negative selection identified in the maize/<it>Sorghum </it>comparison were generally found to coincide with regions under selection in the corresponding direction in maize/sugarcane and <it>Sorghum</it>/sugarcane comparisons, although in a few cases the selection was not found at the <it>p </it>&lt; 0.05 level of significance in all comparisons (Table <tblr tid="T2">2</tblr>). We conclude that these regions are subject to recurrent adaptive evolution.</p>
				<p>In two cases, a region was found to be under significant selection in opposite directions in different comparisons. First, a region in exon 6 (maize codons 232-286) that was not under significant selection in the maize/<it>S. bicolor </it>comparison was under negative selection in the maize/sugarcane comparison, but under positive selection in the <it>S. bicolor</it>/sugarcane comparison; this suggests that positive selection in <it>S. bicolor </it>and negative selection in maize combined to give a non-significant result in the maize/<it>S. bicolor </it>comparison. Second, the region of positive selection in exons 12-14 identified from the maize/<it>Sorghum </it>comparison was under positive selection in the maize/sugarcane comparison, but under negative selection in the <it>Sorghum</it>/sugarcane comparison, indicating that the positive selection in this region is unique to the maize <it>CenpcA </it>lineage (Table <tblr tid="T2">2</tblr>, Figure <figr fid="F3">3b</figr>). Therefore, in some regions of CENP-C adaptive evolution appears to be episodic, as has been seen previously for primate lysozymes <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>.</p>
				<p>Phylogeny-based PAML analysis confirms that plant CENP-Cs are adaptively evolving and in an episodic fashion, consistent with inferences based on pairwise K-estimator analysis. As was found for mammalian CENP-C, the data-driven model for grass CENP-C was found to be significantly more probable than the null model (&#967;<sup>2 </sup>= 12.0; <it>p </it>= 0.003) with K<sub>a</sub>/K<sub>s </sub>= 0.00 for 51% of the 686 sites in the multiple alignment and K<sub>a</sub>/K<sub>s </sub>= 2.00 for 49% of the sites (data not shown). Using the PAML 'free-ratio' option that measures K<sub>a</sub>/K<sub>s </sub>differences between branches in a tree <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>, we found that CENP-C is adaptively evolving (&#967;<sup>2</sup>= 10.0; <it>p </it>= 0.007 for the data-driven over the null model) along both <it>Sorghum </it>lineages (K<sub>a</sub>/K<sub>s </sub>= 2.6) and along the sugarcane <it>Cenpc2 </it>lineage (K<sub>a</sub>/K<sub>s </sub>= 1.3) but not detectably along the sugarcane <it>Cenpc1 </it>lineage (K<sub>a</sub>/K<sub>s </sub>= 0.23). PAML analysis also confirmed that the carboxy-terminal region of maize <it>CenpcA </it>is adaptively evolving (&#967;<sup>2 </sup>= 7.8; <it>p </it>= 0.02 for the data-driven model over the null model) with K<sub>a</sub>/K<sub>s </sub>= 1.4. Thus, different methods of analysis demonstrate that CENP-Cs are adaptively evolving in an episodic fashion in grasses that have multiple CENP-C copies.</p>
				<p>Maize <it>CenpcA </it>is co-expressed with another <it>Cenpc </it>gene, <it>CenpcB </it><abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, for which incomplete sequence information is available (AF129858, AW062057, and AY109432). The available <it>CenpcB </it>sequence begins in exon 5 and continues through exon 14, and has seven in-frame indels relative to <it>CenpcA</it>. We found negative selection in a region from the end of exon 5 through the first half of exon 6 (<it>CenpcA </it>codons 205-358, <it>p </it>&lt; 0.01), and positive selection from the end of exon 6 through exon 7 (codons 403-492, <it>p </it>&lt; 0.01). Elsewhere the null hypothesis could not be rejected. Comparing <it>CenpcB </it>with <it>S. bicolor</it>, K<sub>a </sub>= 0.13 and K<sub>s </sub>= 0.11, and there are six in-frame indels between the sequences. We found positive selection in the first half of exon 6 (<it>Sorghum </it>codons 273-327, <it>p </it>&lt; 0.03) and negative selection from exon 10 through the first few codons of exon 13 (codons 537-619, <it>p </it>&lt; 0.02). Elsewhere the null hypothesis could not be rejected. In summary, regions under negative selection in other grass <it>Cenpc</it>s can be under positive selection in <it>CenpcB</it>, and regions under positive selection in other grass <it>Cenpc</it>s are under negative selection or evolving neutrally in <it>CenpcB </it>(Figure <figr fid="F3">3b</figr>), suggesting that CENP-CB has been subjected to different selective forces since its divergence from CENP-CA.</p>
				<p>Just as gene duplication can result in different selective pressures on the two genes, duplications within a gene can lead to specialization and thus can change selective pressures on the region. Such specialization appears to have occurred between the anciently duplicated region encoded by exons 9-10 and 11-12 (Figure <figr fid="F6">6a</figr>). In maize, sugarcane, and <it>Sorghum </it>we detected negative selection for exons 9-12, but in the more recent duplication of exons 9 and 10 in wheat and barley we detected positive selection in a region from the last codon of the first copy of exon 10 to the first four codons of exon 12 (<it>p </it>&lt; 0.01). Additional windows in exons 9p-10p and 12 had K<sub>a </sub><it>&gt;</it>K<sub>s </sub>(<it>p </it>&gt; 0.05), suggesting that most of the duplicated region has been evolving adaptively (Figure <figr fid="F2">2d</figr>). In contrast, in the adjacent conserved carboxy-terminal region (corresponding to <it>CenpcA </it>codons 625-690), we detected only negative selection (<it>p </it>= 0.01), as though exon duplication allowed for adaptation.</p>
				<p>We find an approximate correspondence between adaptively evolving regions in angiosperms and those in mammals that overlap DNA-binding and centromere-targeting regions. Although no DNA-binding regions have been experimentally determined for plant CENP-Cs, the correspondence with animal CENP-Cs suggests that the different regions play comparable roles. One of the corresponding adaptive regions is repeatedly duplicated in grasses, and the distribution of basic residues in exons 10 and 12 suggests that the repeat unit binds DNA. A parallel situation again appears to be found for the amino-terminal tails of some <it>Drosophila </it>CenH3s, which contain repeats of a minor groove-binding motif that are thought to provide DNA compositional preference <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Thus, both plant and animal CENP-Cs show adaptively evolving features that parallel those found in CenH3s.</p>
			</sec>
			<sec>
				<st>
					<p>Yeast <it>MIF2 </it>is under negative selection</p>
				</st>
				<p>If positive selection for CENP-Cs in plants and animals is related to centromere complexity, then we would expect conventional negative selection to operate in organisms such as budding yeast, which have simple centromeres. The <it>MIF2 </it>genes of <it>Saccharomyces cerevisia</it>e <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> and <it>S. paradoxu</it>s <abbrgrp><abbr bid="B55">55</abbr></abbrgrp> are 93% identical in amino-acid sequence, with K<sub>a </sub>= 0.036 and K<sub>s </sub>= 0.38. In sharp contrast to all pairwise comparisons of plant and animal <it>Cenpc </it>genes, K<sub>a </sub>was much less than K<sub>s </sub>for all of the 99 bp windows of yeast <it>MIF2</it>, indicating that it is under negative selection throughout its length (<it>p </it>&lt; 0.001). In all pairwise comparisons among these two species and the additional species <it>S. mikatae </it>and <it>S. bayanus </it><abbrgrp><abbr bid="B55">55</abbr></abbrgrp>, we consistently found evidence of negative selection with K<sub>a </sub>&lt;&lt; K<sub>s </sub>(range of K<sub>a</sub>, 0.036-0.093; range of K<sub>s</sub>, 0.38-0.82). We also found strong negative selection for all 99 bp windows in pairwise comparisons of yeast CenH3 (Cse4p; data not shown). Thus, adaptive evolution of both CenH3s and CENP-Cs appears to be limited to organisms with complex centromeres.</p>
			</sec>
			<sec>
				<st>
					<p>Meiotic drive model of centromere evolution</p>
				</st>
				<p>We have demonstrated that CENP-C has been adaptively evolving in multiple lineages of both plants and animals, a feature that had been previously shown for some CenH3s. Thus, the occurrence of adaptive evolution appears to be a general feature of proteins that bind to complex centromeres. Recurrent adaptive evolution implies an arms race, and an arms race involving centromeric DNA-binding proteins is remarkable given that centromeres have a conserved function. But centromeric DNA is rapidly evolving in plants and animals, so adaptation of the major centromere DNA-binding proteins would maintain an interface with the conserved kinetochore machinery. Indeed, regions of CENP-C that show evidence of positive selection include DNA-binding and specificity regions, in parallel with previous findings for <it>Drosophila </it>and <it>Arabidopsis </it>CenH3 <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B11">11</abbr><abbr bid="B44">44</abbr></abbrgrp>.</p>
				<p>A 'meiotic drive' model has been proposed to explain the rapid evolution of centromeric DNAs and CenH3s <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. According to this model, centromeres compete during female meiosis for inclusion in the single meiotic product that becomes the egg nucleus and so gets transmitted to the next generation. In both animals and seed plants, which of the four meiotic products becomes the egg nucleus is determined by its position in the female tetrad. A centromere variant will increase in the population if it achieves an orientation in female meiosis resulting in its inclusion in the egg nucleus more frequently than its competitors. For example, an expansion of a satellite array may lead to a 'stronger' centromere variant with an expanded kinetochore that attracts more microtubules and results in a slightly greater probability of a favorable orientation in female meiosis. The mechanism of such orientation is unknown, but in some insects and plants the female meiotic spindle has an asymmetric distribution of microtubules or is monopolar <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>, so a stronger centromere variant might better capture the favored pole. The new variant will therefore increase in the population and eventually become fixed. This meiotic drive process ('centromere drive') can account for the rapid evolution and complex structure of centromeric DNA. As a rare new variant spreads in the population, however, disparities in centromere strength may interfere with fertility in males, where the four meiotic products contribute equally to the next generation. Mutations in CenH3 that restore centromere parity in meiosis will therefore be selected in males, resulting in the adaptive evolution of CenH3 and suppression of the meiotic drive of centromeric DNA. Recurrent cycles of meiotic drive by centromere variants, or centromere drive, and suppression by CenH3 mutations would result in the observed rapid evolution of both centromeres and CenH3s.</p>
				<p>The lack of evidence for adaptive evolution in CenH3s from mammals and grasses does not seem to fit this scenario. But the extensive positive selection on the corresponding CENP-Cs provides a ready explanation for the absence of an adaptive signal for CenH3. The meiotic drive model predicts that over evolutionary time any mutation that restores centromere parity will be selected, suggesting that proteins besides CenH3 - and in particular other kinetochore proteins that contact centromeric DNA - may be positively selected to suppress centromere drive. Our demonstration of the adaptive evolution of CENP-C, especially in DNA-binding regions, fulfills this prediction of the centromere drive model. Apparently, in mammals and grasses CENP-C performs the function of a suppressor of meiotic drive.</p>
				<p>The large size and lack of sequence conservation of CENP-Cs make them much larger mutational targets for suppression than CenH3s. Moreover, PAML analysis suggests that a larger proportion of CENP-C than CenH3 residues are evolving adaptively. Mammalian CENP-A consists of a well-conserved histone-fold domain with only a short unconstrained tail region. Conversely, <it>Drosophila </it>species have the longest CenH3 tails known <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> but lack any identifiable CENP-C homologs. It is tempting to speculate that the interaction of the long CenH3 tail of <it>Drosophila </it>with centromeric satellites compensates for the absence of CENP-C and permitted its loss. This might explain why <it>Drosophila </it>CenH3s localize in a species-specific manner <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>, whereas human CENP-A can be functionally replaced by its budding yeast CenH3 counterpart <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>.</p>
				<p>Centromere drive may have important consequences for karyotypic evolution. Centromeres of two acrocentric chromosomes frequently fuse (Robertsonian translocations), and metacentrics often misdivide to yield two acrocentrics. In humans, there is a bias in favor of Robertsonian translocations over their homologous acrocentric pair when transmitted by females, and male carriers have reduced fertility <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>. This general sterility of Robertsonian males is consistent with centromere drive underlying post-zygotic reproductive isolation in emerging species <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Centromere drive provides a mechanism for the tendency of karyotypes to be either mostly metacentric or mostly acrocentric <abbrgrp><abbr bid="B59">59</abbr></abbrgrp> and for the karyotype-specific accumulation of selfish B chromosomes in mammals <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>. Our finding that CENP-Cs, like CenH3s, evolve adaptively addresses a perceived shortcoming of the centromere drive model for post-zygotic reproductive isolation: mutations that rescued hybrid sterility did not map to the <it>Drosophila </it>CenH3 gene <abbrgrp><abbr bid="B61">61</abbr><abbr bid="B62">62</abbr></abbrgrp>. The fact that CenH3 is not the only adaptively evolving centromere protein indicates that there are multiple candidate drive suppressors that might rescue hybrid sterility when in a mutant form.</p>
				<p>In contrast to CENP-Cs of plants and animals, yeast Mif2p appears to have evolved entirely under negative selection. This is consistent with Mif2p interacting with a stable centromere, rather than one that is rapidly evolving. In accordance with this observation, budding yeast centromeres are determined by the presence of a consensus DNA sequence that includes binding sites for the Cbf1 and CBF3 proteins <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. The consensus DNA sequences and their binding proteins are recognizably similar in yeasts as distantly related as <it>Candida glabrata </it>and <it>Kluyveromyces lactis</it>, which have greater average divergence from budding yeast in protein sequences than mammals have from fish <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. We attribute this extreme conservation of centromere sequence to optimization of the DNA-protein interactions at the centromere. Such optimization would be inevitable in fungi that produce equivalent gametes in a tetrad. No such optimization would occur when centromeres compete at female meiosis I for a favored orientation. Seed plants and animals evolved female meiosis independently, so the parallels that we see for evolution of CenH3 and CENP-C would reflect parallel evolutionary forces in these two ancient lineages.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Materials and methods</p>
			</st>
			<sec>
				<st>
					<p>DNA clones and sequencing</p>
				</st>
				<p>Genomic DNAs and cDNAs were obtained from several sources (Additional data file 2). The <it>A. thaliana </it>cDNA was amplified from a cDNA pool from whole plants of the ecotype Columbia, and the <it>A. arenosa </it>cDNA was amplified from a cDNA pool from leaves of Care-1 <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>. Both of these cDNAs were amplified using the same primers: 5'-GGAATTTTCCGGTGATTTAGATG-3', which terminates in the initiation codon, and 5'-TGATCACAAGAGGATGGTTGA-3', from the 3' untranslated region of the <it>A. thaliana </it>genomic <it>Cenpc </it>sequence. Genomic DNAs from wheat and barley were generously provided by Andreas Houben. Exons 9p-10p and the intervening intron 9p were amplified from both wheat and barley using the primers 5'-AGATGAACCAATCCATCCAC-3' and 5'-AAATTCGTTTTCCTCTCTTTGCT-3'. Likewise, 9q-10q and intron 9q were amplified with the primers 5'-AGATAAGCCAATCCATACATCA-3' and 5'-CCCCTCTTTTCATTCTCTTCAA-3'. The first and last of these four primers were also used to amplify both exon pairs as a unit to confirm their contiguity and to determine the genomic sequence around the junctions of intron 10p with exons 10p and 9q. Amplifications used High Fidelity Platinum <it>Taq </it>polymerase (Invitrogen, Carlsbad, USA). The amplified fragments were cloned using the pCR2.1-TOPO-TA cloning kit (Invitrogen) according to the manufacturer's instructions. Sequencing was carried out using ABI Big Dye sequencing on both strands of all reported sequences. Sequencing primers were standard vector primers or were designed using Primer 3 <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>. Sequences were assembled using Sequencher 4.1.2 software <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>. Accession numbers of sequences are given in Additional data file 2.</p>
			</sec>
			<sec>
				<st>
					<p>Sequence analyses</p>
				</st>
				<p>Sequence similarities of genes and their encoded proteins were identified using the NCBI BLAST server <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B67">67</abbr></abbrgrp>, as well as by use of Gramene <abbrgrp><abbr bid="B68">68</abbr></abbrgrp> and the TIGR Gene Indices <abbrgrp><abbr bid="B69">69</abbr></abbrgrp>. Translations and sequence manipulations utilized the Sequence Manipulation Suite <abbrgrp><abbr bid="B70">70</abbr><abbr bid="B71">71</abbr></abbrgrp>. Alignments of coding and amino-acid sequences were performed using the European Bioinformatics Institute Clustal W Server <abbrgrp><abbr bid="B72">72</abbr><abbr bid="B73">73</abbr></abbrgrp>, with adjustments by hand to take account of splice-site alignment. Conservation in alignments was displayed using MacBoxShade 2.1 (MD Baron, Institute for Animal Health, Surrey, UK). Protein blocks were made, displayed, and compared using the Multiple Alignment Processor, sequence Logos, and LAMA <abbrgrp><abbr bid="B74">74</abbr></abbrgrp> programs on the Blocks WWW Server <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>. To make blocks from grass exons 9-12, gaps in ClustalW alignments were first filled with Xs, which do not appear in subsequent sequence Logos representations. Gene models of exon-intron boundaries were made by alignment of cDNAs with identical or homologous genomic sequences, as well as by splice-site prediction using the NetGene2 server <abbrgrp><abbr bid="B76">76</abbr><abbr bid="B77">77</abbr></abbrgrp>.</p>
				<p>K-estimator <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> was used to estimate K<sub>a </sub>and K<sub>s </sub>in comparisons of <it>Cenpa/CENH3 </it>or <it>Cenpc </it>genes from pairs of closely related species. Prior to analysis, gaps were removed from the coding sequences as indicated by the amino-acid alignments.</p>
				<p>We estimated K<sub>a </sub>and K<sub>s </sub>for windows of 99 nucleotides, positioned every 33 nucleotides. For candidate regions of positive selection, we determined the confidence intervals of K<sub>a </sub>and K<sub>s </sub>under the null hypothesis that K<sub>a </sub>is equal to K<sub>s </sub>using the default parameters (1,000 replicates). For individual windows of 99 nucleotides, or for regions defined by contiguous groups of overlapping windows, limited trial and error suggested that statistically significant positive selection was not supported if K<sub>a</sub>/K<sub>s </sub>&lt; 1.5. Therefore, to find evidence of positive selection, we determined the confidence intervals for regions defined by sets of overlapping or immediately adjacent 99 nucleotide windows with K<sub>a</sub>/K<sub>s</sub> &#8805; 1.5. For regions with K<sub>s </sub>= 0, one or more flanking windows with K<sub>s </sub>&gt; 0 were included in the region analyzed, regardless of the value of K<sub>a</sub>, so that a value for K<sub>a</sub>/K<sub>s </sub>could be defined. Similarly, we looked for statistically significant negative selection for regions defined by overlapping or adjacent 99 nucleotide windows with K<sub>a</sub>/K<sub>s </sub>&#8804; 0.67. The codeml program of PAML version 3.13d <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> was also used to test for positive selection and to estimate K<sub>a</sub>/K<sub>s </sub>ratios as previously described <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Additional data files</p>
			</st>
			<p>The following are provided as additional data files. Additional data file <supplr sid="s1">1</supplr>, containing Table S1, reports accession numbers for selected <it>Cenpc </it>ESTs and genomic sequences from GenBank. Additional data file <supplr sid="s2">2</supplr>, containing Table S2, reports accession numbers for <it>Cenpc </it>cDNAs and amplified genomic sequences. Additional data file <supplr sid="s3">3</supplr>, containing Figure S1, displays the conservation of the exon containing the CENPC motif.</p>
			<suppl id="s1">
				<title>
					<p>Additional data file 1</p>
				</title>
				<caption>
					<p>Table S1 reports accession numbers for selected <it>Cenpc </it>ESTs and genomic sequences from GenBank</p>
				</caption>
				<text>
					<p>Table S1 reports accession numbers for selected <it>Cenpc </it>ESTs and genomic sequences from GenBank</p>
				</text>
				<file name="jbiol11-s1.pdf">
					<p>Click here for additional data file</p>
				</file>
			</suppl>
			<suppl id="s2">
				<title>
					<p>Additional data file 2</p>
				</title>
				<caption>
					<p>Table S2 reports accession numbers for <it>Cenpc </it>cDNAs and amplified genomic sequences</p>
				</caption>
				<text>
					<p>Table S2 reports accession numbers for <it>Cenpc </it>cDNAs and amplified genomic sequences</p>
				</text>
				<file name="jbiol11-s2.pdf">
					<p>Click here for additional data file</p>
				</file>
			</suppl>
			<suppl id="s3">
				<title>
					<p>Additional data file 3</p>
				</title>
				<caption>
					<p>Figure S1 displays the conservation of the exon containing the CENPC motif</p>
				</caption>
				<text>
					<p>Figure S1 displays the conservation of the exon containing the CENPC motif</p>
				</text>
				<file name="jbiol11-s3.pdf">
					<p>Click here for additional data file</p>
				</file>
			</suppl>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>We thank Harmit Malik, Jennifer Cooper, and Caro-Beth Stewart for helpful discussions and Jorja Henikoff for help with LAMA. We also thank the American Type Culture Collection, the Arizona Genomics Institute, Clemson University Genomics Institute, Marie-Michele Cordonnier-Pratt and Lee Pratt, Ze-Guang Han, the Institute of Chemistry of the University of S&#227;o Paolo, the Japanese Rice Program of the National Institute of Agrobiological Sciences, the Samuel Roberts Noble Foundation, Benildo G. de los Reyes, Bento Soares, the United States Department of Agriculture, Bernd Weisshaar, and Ian Wilson for supplying cDNAs, and Andreas Houben for supplying wheat and barley genomic DNAs.</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>The centromere paradox: stable inheritance with rapidly evolving DNA</p>
				</title>
				<aug>
					<au>
						<snm>Henikoff</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Ahmad</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Malik</snm>
						<fnm>HS</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2001</pubdate>
				<volume>293</volume>
				<fpage>1098</fpage>
				<lpage>1102</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1062939</pubid>
						<pubid idtype="pmpid" link="fulltext">11498581</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Genomic and genetic definition of a functional human centromere</p>
				</title>
				<aug>
					<au>
						<snm>Schueler</snm>
						<fnm>MG</fnm>
					</au>
					<au>
						<snm>Higgins</snm>
						<fnm>AW</fnm>
					</au>
					<au>
						<snm>Rudd</snm>
						<fnm>MK</fnm>
					</au>
					<au>
						<snm>Gustashaw</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Willard</snm>
						<fnm>HF</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2001</pubdate>
				<volume>294</volume>
				<fpage>109</fpage>
				<lpage>115</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">11588252</pubid>
						<pubid idtype="doi">10.1126/science.1065042</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Evolution of satellite DNA sequences in <it>Drosophila</it></p>
				</title>
				<aug>
					<au>
						<snm>Lohe</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Roberts</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>In Heterochromatin, Molecular and Structural Aspects</source>
				<publisher>Cambridge: Cambridge University Press</publisher>
				<editor>Verma RS</editor>
				<pubdate>1988</pubdate>
				<fpage>148</fpage>
				<lpage>186</lpage>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Chromosome-specific alpha-satellite DNA from the centromere of chimpanzee chromosome 4</p>
				</title>
				<aug>
					<au>
						<snm>Haaf</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Willard</snm>
						<fnm>HF</fnm>
					</au>
				</aug>
				<source>Chromosoma</source>
				<pubdate>1997</pubdate>
				<volume>106</volume>
				<fpage>226</fpage>
				<lpage>232</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/s004120050243</pubid>
						<pubid idtype="pmpid" link="fulltext">9254724</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Tandemly repeated DNA sequences and centromeric chromosomal regions of <it>Arabidopsis </it>species</p>
				</title>
				<aug>
					<au>
						<snm>Heslop-Harrison</snm>
						<fnm>JS</fnm>
					</au>
					<au>
						<snm>Brandes</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Schwarzacher</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>Chromosome Res</source>
				<pubdate>2003</pubdate>
				<volume>11</volume>
				<fpage>241</fpage>
				<lpage>253</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1023/A:1022998709969</pubid>
						<pubid idtype="pmpid">12769291</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Domain organization at the centromere and neocentromere</p>
				</title>
				<aug>
					<au>
						<snm>Choo</snm>
						<fnm>KH</fnm>
					</au>
				</aug>
				<source>Dev Cell</source>
				<pubdate>2001</pubdate>
				<volume>1</volume>
				<fpage>165</fpage>
				<lpage>177</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S1534-5807(01)00028-4</pubid>
						<pubid idtype="pmpid">11702777</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>A 17-kD centromere protein (CENP-A) copurifies with nucleosome core particles and with histones</p>
				</title>
				<aug>
					<au>
						<snm>Palmer</snm>
						<fnm>DK</fnm>
					</au>
					<au>
						<snm>O'Day</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Wener</snm>
						<fnm>MH</fnm>
					</au>
					<au>
						<snm>Andrews</snm>
						<fnm>BS</fnm>
					</au>
					<au>
						<snm>Margolis</snm>
						<fnm>RL</fnm>
					</au>
				</aug>
				<source>J Cell Biol</source>
				<pubdate>1987</pubdate>
				<volume>104</volume>
				<fpage>805</fpage>
				<lpage>815</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1083/jcb.104.4.805</pubid>
						<pubid idtype="pmpid" link="fulltext">3558482</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>Human centromere protein A (CENP-A) can replace histone 3 in nucleosome reconstitution <it>in vitro</it></p>
				</title>
				<aug>
					<au>
						<snm>Yoda</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Ando</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Morishita</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Houmura</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Hashimoto</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Takeyasu</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Okazaki</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2000</pubdate>
				<volume>97</volume>
				<fpage>7266</fpage>
				<lpage>7271</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">16534</pubid>
						<pubid idtype="pmpid" link="fulltext">10840064</pubid>
						<pubid idtype="doi">10.1073/pnas.130189697</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Adaptive evolution of Cid, a centromere-specific histone in <it>Drosophila</it></p>
				</title>
				<aug>
					<au>
						<snm>Malik</snm>
						<fnm>HS</fnm>
					</au>
					<au>
						<snm>Henikoff</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Genetics</source>
				<pubdate>2001</pubdate>
				<volume>157</volume>
				<fpage>1293</fpage>
				<lpage>1298</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">11238413</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Centromeric localization and adaptive evolution of an <it>Arabidopsis </it>histone H3 variant</p>
				</title>
				<aug>
					<au>
						<snm>Talbert</snm>
						<fnm>PB</fnm>
					</au>
					<au>
						<snm>Masuelli</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Tyagi</snm>
						<fnm>AP</fnm>
					</au>
					<au>
						<snm>Comai</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Henikoff</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Plant Cell</source>
				<pubdate>2002</pubdate>
				<volume>14</volume>
				<fpage>1053</fpage>
				<lpage>1066</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">150606</pubid>
						<pubid idtype="pmpid" link="fulltext">12034896</pubid>
						<pubid idtype="doi">10.1105/tpc.010425</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>Adaptive evolution of the histone fold domain in centromeric histones</p>
				</title>
				<aug>
					<au>
						<snm>Cooper</snm>
						<fnm>JL</fnm>
					</au>
					<au>
						<snm>Henikoff</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Mol Biol Evol</source>
				<pubdate>2004</pubdate>
				<volume>21</volume>
				<fpage>1712</fpage>
				<lpage>1718</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/molbev/msh179</pubid>
						<pubid idtype="pmpid" link="fulltext">15175412</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Recurrent evolution of DNA-binding motifs in the <it>Drosophila </it>centromeric histone</p>
				</title>
				<aug>
					<au>
						<snm>Malik</snm>
						<fnm>HS</fnm>
					</au>
					<au>
						<snm>Vermaak</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Henikoff</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2002</pubdate>
				<volume>99</volume>
				<fpage>1449</fpage>
				<lpage>1454</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">122211</pubid>
						<pubid idtype="pmpid" link="fulltext">11805302</pubid>
						<pubid idtype="doi">10.1073/pnas.032664299</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>CENP-C, an autoantigen in scleroderma, is a component of the human inner kinetochore plate</p>
				</title>
				<aug>
					<au>
						<snm>Saitoh</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Tomkiel</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Cooke</snm>
						<fnm>CA</fnm>
					</au>
					<au>
						<snm>Ratrie</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Maurer</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Rothfield</snm>
						<fnm>NF</fnm>
					</au>
					<au>
						<snm>Earnshaw</snm>
						<fnm>WC</fnm>
					</au>
				</aug>
				<source>Cell</source>
				<pubdate>1992</pubdate>
				<volume>70</volume>
				<fpage>115</fpage>
				<lpage>125</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/0092-8674(92)90538-N</pubid>
						<pubid idtype="pmpid">1339310</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Sequence homologies and linkage group conservation of the human and mouse <it>Cenpc </it>genes</p>
				</title>
				<aug>
					<au>
						<snm>McKay</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Thomson</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Cooke</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Genomics</source>
				<pubdate>1994</pubdate>
				<volume>22</volume>
				<fpage>36</fpage>
				<lpage>40</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/geno.1994.1342</pubid>
						<pubid idtype="pmpid" link="fulltext">7959789</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>Sheep CENPB and CENPC genes show a high level of sequence similarity and conserved synteny with their human homologs</p>
				</title>
				<aug>
					<au>
						<snm>Burkin</snm>
						<fnm>DJ</fnm>
					</au>
					<au>
						<snm>Jones</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Burkin</snm>
						<fnm>HR</fnm>
					</au>
					<au>
						<snm>McGrew</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Broad</snm>
						<fnm>TE</fnm>
					</au>
				</aug>
				<source>Cytogenet Cell Genet</source>
				<pubdate>1996</pubdate>
				<volume>74</volume>
				<fpage>86</fpage>
				<lpage>89</lpage>
				<xrefbib>
					<pubid idtype="pmpid">8893808</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B16">
				<title>
					<p>Efficient conditional mutation of the vertebrate CENP-C gene</p>
				</title>
				<aug>
					<au>
						<snm>Fukagawa</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>WR</fnm>
					</au>
				</aug>
				<source>Hum Mol Genet</source>
				<pubdate>1997</pubdate>
				<volume>6</volume>
				<fpage>2301</fpage>
				<lpage>2308</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/hmg/6.13.2301</pubid>
						<pubid idtype="pmpid" link="fulltext">9361037</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>Characterization of internal DNA-binding and C-terminal dimerization domains of human centromere/kinetochore autoantigen CENP-C <it>in vitro </it>: role of DNA-binding and self-associating activities in kinetochore organization</p>
				</title>
				<aug>
					<au>
						<snm>Sugimoto</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Kuriyama</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Shibata</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Himeno</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Chromosome Res</source>
				<pubdate>1997</pubdate>
				<volume>5</volume>
				<fpage>132</fpage>
				<lpage>141</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1023/A:1018422325569</pubid>
						<pubid idtype="pmpid">9146917</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B18">
				<title>
					<p>Human centromere protein C (CENP-C) is a DNA-binding protein which possesses a novel DNA-binding motif</p>
				</title>
				<aug>
					<au>
						<snm>Sugimoto</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Yata</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Muro</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Himeno</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>J Biochem</source>
				<pubdate>1994</pubdate>
				<volume>116</volume>
				<fpage>877</fpage>
				<lpage>881</lpage>
				<xrefbib>
					<pubid idtype="pmpid">7883764</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>Identification of overlapping DNA-binding and centromere-targeting domains in the human kinetochore protein CENP-C</p>
				</title>
				<aug>
					<au>
						<snm>Yang</snm>
						<fnm>CH</fnm>
					</au>
					<au>
						<snm>Tomkiel</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Saitoh</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Johnson</snm>
						<fnm>DH</fnm>
					</au>
					<au>
						<snm>Earnshaw</snm>
						<fnm>WC</fnm>
					</au>
				</aug>
				<source>Mol Cell Biol</source>
				<pubdate>1996</pubdate>
				<volume>16</volume>
				<fpage>3576</fpage>
				<lpage>3586</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">231353</pubid>
						<pubid idtype="pmpid" link="fulltext">8668174</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B20">
				<title>
					<p>CENP-C binds the alpha-satellite DNA <it>in vivo </it>at specific centromere domains</p>
				</title>
				<aug>
					<au>
						<snm>Politi</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Perini</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Trazzi</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Pliss</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Raska</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Earnshaw</snm>
						<fnm>WC</fnm>
					</au>
					<au>
						<snm>Della Valle</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>J Cell Sci</source>
				<pubdate>2002</pubdate>
				<volume>115</volume>
				<fpage>2317</fpage>
				<lpage>2327</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">12006616</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p><it>In vivo </it>functional dissection of human inner kinetochore protein CENP-C</p>
				</title>
				<aug>
					<au>
						<snm>Trazzi</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Bernardoni</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Diolaiti</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Politi</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Earnshaw</snm>
						<fnm>WC</fnm>
					</au>
					<au>
						<snm>Perini</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Della Valle</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>J Struct Biol</source>
				<pubdate>2002</pubdate>
				<volume>140</volume>
				<fpage>39</fpage>
				<lpage>48</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S1047-8477(02)00506-3</pubid>
						<pubid idtype="pmpid" link="fulltext">12490152</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B22">
				<title>
					<p>Sequence similarities between the yeast chromosome segregation protein Mif2 and the mammalian centromere protein CENP-C</p>
				</title>
				<aug>
					<au>
						<snm>Brown</snm>
						<fnm>MT</fnm>
					</au>
				</aug>
				<source>Gene</source>
				<pubdate>1995</pubdate>
				<volume>160</volume>
				<fpage>111</fpage>
				<lpage>116</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/0378-1119(95)00163-Z</pubid>
						<pubid idtype="pmpid" link="fulltext">7628703</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B23">
				<title>
					<p>Evidence that the <it>MIF2 </it>gene of <it>Saccharomyces cerevisiae </it>encodes a centromere protein with homology to the mammalian centromere protein CENP-C</p>
				</title>
				<aug>
					<au>
						<snm>Meluh</snm>
						<fnm>PB</fnm>
					</au>
					<au>
						<snm>Koshland</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Mol Biol Cell</source>
				<pubdate>1995</pubdate>
				<volume>6</volume>
				<fpage>793</fpage>
				<lpage>807</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">301241</pubid>
						<pubid idtype="pmpid">7579695</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<title>
					<p>HCP-4, a CENP-C-like protein in <it>Caenorhabditis elegans</it>, is required for resolution of sister centromeres</p>
				</title>
				<aug>
					<au>
						<snm>Moore</snm>
						<fnm>LL</fnm>
					</au>
					<au>
						<snm>Roth</snm>
						<fnm>MB</fnm>
					</au>
				</aug>
				<source>J Cell Biol</source>
				<pubdate>2001</pubdate>
				<volume>153</volume>
				<fpage>1199</fpage>
				<lpage>1208</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1083/jcb.153.6.1199</pubid>
						<pubid idtype="pmpid" link="fulltext">11402064</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B25">
				<title>
					<p>A maize homolog of mammalian CENPC is a constitutive component of the inner kinetochore</p>
				</title>
				<aug>
					<au>
						<snm>Dawe</snm>
						<fnm>RK</fnm>
					</au>
					<au>
						<snm>Reed</snm>
						<fnm>LM</fnm>
					</au>
					<au>
						<snm>Yu</snm>
						<fnm>HG</fnm>
					</au>
					<au>
						<snm>Muszynski</snm>
						<fnm>MG</fnm>
					</au>
					<au>
						<snm>Hiatt</snm>
						<fnm>EN</fnm>
					</au>
				</aug>
				<source>Plant Cell</source>
				<pubdate>1999</pubdate>
				<volume>11</volume>
				<fpage>1227</fpage>
				<lpage>1238</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">144275</pubid>
						<pubid idtype="pmpid" link="fulltext">10402425</pubid>
						<pubid idtype="doi">10.1105/tpc.11.7.1227</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B26">
				<title>
					<p>MIF2 is required for mitotic spindle integrity during anaphase spindle elongation in <it>Saccharomyces cerevisiae</it></p>
				</title>
				<aug>
					<au>
						<snm>Brown</snm>
						<fnm>MT</fnm>
					</au>
					<au>
						<snm>Goetsch</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Hartwell</snm>
						<fnm>LH</fnm>
					</au>
				</aug>
				<source>J Cell Biol</source>
				<pubdate>1993</pubdate>
				<volume>123</volume>
				<fpage>387</fpage>
				<lpage>403</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1083/jcb.123.2.387</pubid>
						<pubid idtype="pmpid" link="fulltext">8408221</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B27">
				<title>
					<p>CENP-C is required for maintaining proper kinetochore size and for a timely transition to anaphase</p>
				</title>
				<aug>
					<au>
						<snm>Tomkiel</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Cooke</snm>
						<fnm>CA</fnm>
					</au>
					<au>
						<snm>Saitoh</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Bernat</snm>
						<fnm>RL</fnm>
					</au>
					<au>
						<snm>Earnshaw</snm>
						<fnm>WC</fnm>
					</au>
				</aug>
				<source>J Cell Biol</source>
				<pubdate>1994</pubdate>
				<volume>125</volume>
				<fpage>531</fpage>
				<lpage>545</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1083/jcb.125.3.531</pubid>
						<pubid idtype="pmpid" link="fulltext">8175879</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B28">
				<title>
					<p>Targeted disruption of mouse centromere protein C gene leads to mitotic disarray and early embryo death</p>
				</title>
				<aug>
					<au>
						<snm>Kalitsis</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Fowler</snm>
						<fnm>KJ</fnm>
					</au>
					<au>
						<snm>Earle</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Hill</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Choo</snm>
						<fnm>KH</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>1998</pubdate>
				<volume>95</volume>
				<fpage>1136</fpage>
				<lpage>1141</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">18698</pubid>
						<pubid idtype="pmpid" link="fulltext">9448298</pubid>
						<pubid idtype="doi">10.1073/pnas.95.3.1136</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B29">
				<title>
					<p>Gene structure and sequence analysis of mouse centromere proteins A and C</p>
				</title>
				<aug>
					<au>
						<snm>Kalitsis</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>MacDonald</snm>
						<fnm>AC</fnm>
					</au>
					<au>
						<snm>Newson</snm>
						<fnm>AJ</fnm>
					</au>
					<au>
						<snm>Hudson</snm>
						<fnm>DF</fnm>
					</au>
					<au>
						<snm>Choo</snm>
						<fnm>KH</fnm>
					</au>
				</aug>
				<source>Genomics</source>
				<pubdate>1998</pubdate>
				<volume>47</volume>
				<fpage>108</fpage>
				<lpage>114</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/geno.1997.5109</pubid>
						<pubid idtype="pmpid" link="fulltext">9465302</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B30">
				<title>
					<p>NCBI High-throughput genomic sequences</p>
				</title>
				<url>http://www.ncbi.nlm.nih.gov/HTGS/index.html</url>
			</bibl>
			<bibl id="B31">
				<title>
					<p>K-Estimator: calculation of the number of nucleotide substitutions per site and the confidence intervals</p>
				</title>
				<aug>
					<au>
						<snm>Comeron</snm>
						<fnm>JM</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>1999</pubdate>
				<volume>15</volume>
				<fpage>763</fpage>
				<lpage>764</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">10498777</pubid>
						<pubid idtype="doi">10.1093/bioinformatics/15.9.763</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B32">
				<title>
					<p>Molecular cloning and sequence analysis of hamster CENP-A cDNA</p>
				</title>
				<aug>
					<au>
						<snm>Figueroa</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Pendon</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Valdivia</snm>
						<fnm>MM</fnm>
					</au>
				</aug>
				<source>BMC Genomics</source>
				<pubdate>2002</pubdate>
				<volume>3</volume>
				<fpage>11</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">113255</pubid>
						<pubid idtype="pmpid" link="fulltext">12019018</pubid>
						<pubid idtype="doi">10.1186/1471-2164-3-11</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B33">
				<title>
					<p>Human CENP-A contains a histone H3 related histone fold that is required for targeting to the centromere</p>
				</title>
				<aug>
					<au>
						<snm>Sullivan</snm>
						<fnm>KF</fnm>
					</au>
					<au>
						<snm>Hechenberger</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Masri</snm>
						<fnm>K</fnm>
					</au>
				</aug>
				<source>J Cell Biol</source>
				<pubdate>1994</pubdate>
				<volume>127</volume>
				<fpage>581</fpage>
				<lpage>592</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1083/jcb.127.3.581</pubid>
						<pubid idtype="pmpid" link="fulltext">7962047</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B34">
				<title>
					<p>Chimpanzee Genome</p>
				</title>
				<url>http://www.genome.wustl.edu/projects/chimp</url>
			</bibl>
			<bibl id="B35">
				<title>
					<p>NCBI BLAST</p>
				</title>
				<url>http://www.ncbi.nlm.nih.gov/BLAST</url>
			</bibl>
			<bibl id="B36">
				<title>
					<p>Centromeric retroelements and satellites interact with maize kinetochore protein CENH3</p>
				</title>
				<aug>
					<au>
						<snm>Zhong</snm>
						<fnm>CX</fnm>
					</au>
					<au>
						<snm>Marshall</snm>
						<fnm>JB</fnm>
					</au>
					<au>
						<snm>Topp</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Mroczek</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Kato</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Nagaki</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Birchler</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Jiang</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Dawe</snm>
						<fnm>RK</fnm>
					</au>
				</aug>
				<source>Plant Cell</source>
				<pubdate>2002</pubdate>
				<volume>14</volume>
				<fpage>2825</fpage>
				<lpage>2836</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">152730</pubid>
						<pubid idtype="pmpid" link="fulltext">12417704</pubid>
						<pubid idtype="doi">10.1105/tpc.006106</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B37">
				<title>
					<p>Analysis and functional annotation of an expressed sequence tag collection for tropical crop sugarcane</p>
				</title>
				<aug>
					<au>
						<snm>Vettore</snm>
						<fnm>AL</fnm>
					</au>
					<au>
						<snm>da Silva</snm>
						<fnm>FR</fnm>
					</au>
					<au>
						<snm>Kemper</snm>
						<fnm>EL</fnm>
					</au>
					<au>
						<snm>Souza</snm>
						<fnm>GM</fnm>
					</au>
					<au>
						<snm>da Silva</snm>
						<fnm>AM</fnm>
					</au>
					<au>
						<snm>Ferro</snm>
						<fnm>MI</fnm>
					</au>
					<au>
						<snm>Henrique-Silva</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Giglioti</snm>
						<fnm>EA</fnm>
					</au>
					<au>
						<snm>Lemos</snm>
						<fnm>MV</fnm>
					</au>
					<au>
						<snm>Coutinho</snm>
						<fnm>LL</fnm>
					</au>
					<etal/>
				</aug>
				<source>Genome Res</source>
				<pubdate>2003</pubdate>
				<volume>13</volume>
				<fpage>2725</fpage>
				<lpage>2735</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">403815</pubid>
						<pubid idtype="pmpid" link="fulltext">14613979</pubid>
						<pubid idtype="doi">10.1101/gr.1532103</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B38">
				<title>
					<p>Cloning and characterization of centromere specific DNA element from <it>Sorghum bicolor</it></p>
				</title>
				<aug>
					<au>
						<snm>Miller</snm>
						<fnm>JT</fnm>
					</au>
					<au>
						<snm>Jackson</snm>
						<fnm>SA</fnm>
					</au>
					<au>
						<snm>Nasuda</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Gill</snm>
						<fnm>BS</fnm>
					</au>
					<au>
						<snm>Wing</snm>
						<fnm>RA</fnm>
					</au>
					<au>
						<snm>Jiang</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Theor Appl Genet</source>
				<pubdate>1998</pubdate>
				<volume>96</volume>
				<fpage>832</fpage>
				<lpage>839</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1007/s001220050809</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B39">
				<title>
					<p>FISH mapping of centromere protein C (CENPC) on human chromosome 4q31-q21</p>
				</title>
				<aug>
					<au>
						<snm>Xie</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Heng</snm>
						<fnm>HH</fnm>
					</au>
				</aug>
				<source>Cytogenet Cell Genet</source>
				<pubdate>1996</pubdate>
				<volume>74</volume>
				<fpage>192</fpage>
				<lpage>193</lpage>
				<xrefbib>
					<pubid idtype="pmpid">8941372</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B40">
				<title>
					<p>PAML: a program package for phylogenetic analysis by maximum likelihood</p>
				</title>
				<aug>
					<au>
						<snm>Yang</snm>
						<fnm>Z</fnm>
					</au>
				</aug>
				<source>Comput Appl Biosci</source>
				<pubdate>1997</pubdate>
				<volume>13</volume>
				<fpage>555</fpage>
				<lpage>556</lpage>
				<xrefbib>
					<pubid idtype="pmpid">9367129</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B41">
				<title>
					<p>CENP-B interacts with CENP-C domains containing Mif2 regions responsible for centromere localization</p>
				</title>
				<aug>
					<au>
						<snm>Suzuki</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Nakano</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Nozaki</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Egashira</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Okazaki</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Masumoto</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>J Biol Chem</source>
				<pubdate>2004</pubdate>
				<volume>279</volume>
				<fpage>5934</fpage>
				<lpage>5946</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1074/jbc.M306477200</pubid>
						<pubid idtype="pmpid" link="fulltext">14612452</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B42">
				<title>
					<p>Mutational analysis of the central centromere targeting domain of human centromere protein C, (CENP-C)</p>
				</title>
				<aug>
					<au>
						<snm>Song</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Gronemeyer</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Lu</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Eugster</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Tomkiel</snm>
						<fnm>JE</fnm>
					</au>
				</aug>
				<source>Exp Cell Res</source>
				<pubdate>2002</pubdate>
				<volume>275</volume>
				<fpage>81</fpage>
				<lpage>91</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/excr.2002.5495</pubid>
						<pubid idtype="pmpid" link="fulltext">11925107</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B43">
				<title>
					<p>Domains required for CENP-C assembly at the kinetochore</p>
				</title>
				<aug>
					<au>
						<snm>Lanini</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>McKeon</snm>
						<fnm>F</fnm>
					</au>
				</aug>
				<source>Mol Biol Cell</source>
				<pubdate>1995</pubdate>
				<volume>6</volume>
				<fpage>1049</fpage>
				<lpage>1059</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">301262</pubid>
						<pubid idtype="pmpid">7579707</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B44">
				<title>
					<p>Centromere targeting element within the histone fold domain of Cid</p>
				</title>
				<aug>
					<au>
						<snm>Vermaak</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Hayden</snm>
						<fnm>HS</fnm>
					</au>
					<au>
						<snm>Henikoff</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Mol Cell Biol</source>
				<pubdate>2002</pubdate>
				<volume>22</volume>
				<fpage>7553</fpage>
				<lpage>7561</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">135675</pubid>
						<pubid idtype="pmpid" link="fulltext">12370302</pubid>
						<pubid idtype="doi">10.1128/MCB.22.21.7553-7561.2002</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B45">
				<title>
					<p>Human centromere chromatin protein hMis12, essential for equal segregation, is independent of CENP-A loading pathway</p>
				</title>
				<aug>
					<au>
						<snm>Goshima</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Kiyomitsu</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Yoda</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Yanagida</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>J Cell Biol</source>
				<pubdate>2003</pubdate>
				<volume>160</volume>
				<fpage>25</fpage>
				<lpage>39</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1083/jcb.200210005</pubid>
						<pubid idtype="pmpid" link="fulltext">12515822</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B46">
				<title>
					<p>Early disruption of centromeric chromatin organization in centromere protein A (CenpA) null mice</p>
				</title>
				<aug>
					<au>
						<snm>Howman</snm>
						<fnm>EV</fnm>
					</au>
					<au>
						<snm>Fowler</snm>
						<fnm>KJ</fnm>
					</au>
					<au>
						<snm>Newson</snm>
						<fnm>AJ</fnm>
					</au>
					<au>
						<snm>Redward</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>MacDonald</snm>
						<fnm>AC</fnm>
					</au>
					<au>
						<snm>Kalitsis</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Choo</snm>
						<fnm>KH</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2000</pubdate>
				<volume>97</volume>
				<fpage>1148</fpage>
				<lpage>1153</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">15551</pubid>
						<pubid idtype="pmpid" link="fulltext">10655499</pubid>
						<pubid idtype="doi">10.1073/pnas.97.3.1148</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B47">
				<title>
					<p>Functional analysis of kinetochore assembly in <it>Caenorhabditis elegans</it></p>
				</title>
				<aug>
					<au>
						<snm>Oegema</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Desai</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Rybina</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Kirkham</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Hyman</snm>
						<fnm>AA</fnm>
					</au>
				</aug>
				<source>J Cell Biol</source>
				<pubdate>2001</pubdate>
				<volume>153</volume>
				<fpage>1209</fpage>
				<lpage>1226</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1083/jcb.153.6.1209</pubid>
						<pubid idtype="pmpid" link="fulltext">11402065</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B48">
				<title>
					<p>Specification of kinetochore-forming chromatin by the histone H3 variant CENP-A</p>
				</title>
				<aug>
					<au>
						<snm>Van Hooser</snm>
						<fnm>AA</fnm>
					</au>
					<au>
						<snm>Ouspenski</snm>
						<fnm>II</fnm>
					</au>
					<au>
						<snm>Gregson</snm>
						<fnm>HC</fnm>
					</au>
					<au>
						<snm>Starr</snm>
						<fnm>DA</fnm>
					</au>
					<au>
						<snm>Yen</snm>
						<fnm>TJ</fnm>
					</au>
					<au>
						<snm>Goldberg</snm>
						<fnm>ML</fnm>
					</au>
					<au>
						<snm>Yokomori</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Earnshaw</snm>
						<fnm>WC</fnm>
					</au>
					<au>
						<snm>Sullivan</snm>
						<fnm>KF</fnm>
					</au>
					<au>
						<snm>Brinkley</snm>
						<fnm>BR</fnm>
					</au>
				</aug>
				<source>J Cell Sci</source>
				<pubdate>2001</pubdate>
				<volume>114</volume>
				<fpage>3529</fpage>
				<lpage>3542</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">11682612</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B49">
				<title>
					<p>Architecture of the budding yeast kinetochore reveals a conserved molecular core</p>
				</title>
				<aug>
					<au>
						<snm>Westermann</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Cheeseman</snm>
						<fnm>IM</fnm>
					</au>
					<au>
						<snm>Anderson</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Yates</snm>
						<fnm>JR</fnm>
						<suf>3rd</suf>
					</au>
					<au>
						<snm>Drubin</snm>
						<fnm>DG</fnm>
					</au>
					<au>
						<snm>Barnes</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>J Cell Biol</source>
				<pubdate>2003</pubdate>
				<volume>163</volume>
				<fpage>215</fpage>
				<lpage>222</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1083/jcb.200305100</pubid>
						<pubid idtype="pmpid" link="fulltext">14581449</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B50">
				<title>
					<p>Crystal structure of the nucleosome core particle at 2.8 &#197; resolution</p>
				</title>
				<aug>
					<au>
						<snm>Luger</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Mader</snm>
						<fnm>AW</fnm>
					</au>
					<au>
						<snm>Richmond</snm>
						<fnm>RK</fnm>
					</au>
					<au>
						<snm>Sargent</snm>
						<fnm>DF</fnm>
					</au>
					<au>
						<snm>Richmond</snm>
						<fnm>TJ</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>1997</pubdate>
				<volume>389</volume>
				<fpage>251</fpage>
				<lpage>260</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/38444</pubid>
						<pubid idtype="pmpid" link="fulltext">9305837</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B51">
				<title>
					<p>Gene structure, chromosomal localization and immunolocalization of chicken centromere proteins CENP-C and ZW10</p>
				</title>
				<aug>
					<au>
						<snm>Okamura</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Pendon</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Valdivia</snm>
						<fnm>MM</fnm>
					</au>
					<au>
						<snm>Ikemura</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Fukagawa</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>Gene</source>
				<pubdate>2001</pubdate>
				<volume>262</volume>
				<fpage>283</fpage>
				<lpage>290</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0378-1119(00)00517-5</pubid>
						<pubid idtype="pmpid" link="fulltext">11179694</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B52">
				<title>
					<p>Differential localization of the centromere-specific proteins in the major centromeric satellite of <it>Arabidopsis thaliana</it></p>
				</title>
				<aug>
					<au>
						<snm>Shibata</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Murata</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>J Cell Sci</source>
				<pubdate>2004</pubdate>
				<volume>117</volume>
				<fpage>2963</fpage>
				<lpage>2970</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1242/jcs.01144</pubid>
						<pubid idtype="pmpid" link="fulltext">15161939</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B53">
				<title>
					<p>Episodic adaptive evolution of primate lysozymes</p>
				</title>
				<aug>
					<au>
						<snm>Messier</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Stewart</snm>
						<fnm>CB</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>1997</pubdate>
				<volume>385</volume>
				<fpage>151</fpage>
				<lpage>154</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/385151a0</pubid>
						<pubid idtype="pmpid">8990116</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B54">
				<title>
					<p>Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution</p>
				</title>
				<aug>
					<au>
						<snm>Yang</snm>
						<fnm>Z</fnm>
					</au>
				</aug>
				<source>Mol Biol Evol</source>
				<pubdate>1998</pubdate>
				<volume>15</volume>
				<fpage>568</fpage>
				<lpage>573</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">9580986</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B55">
				<title>
					<p>Sequencing and comparison of yeast species to identify genes and regulatory elements</p>
				</title>
				<aug>
					<au>
						<snm>Kellis</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Patterson</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Endrizzi</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Birren</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Lander</snm>
						<fnm>ES</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2003</pubdate>
				<volume>423</volume>
				<fpage>241</fpage>
				<lpage>254</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature01644</pubid>
						<pubid idtype="pmpid" link="fulltext">12748633</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B56">
				<title>
					<p>Nonrandom segregation during meiosis: the unfairness of females</p>
				</title>
				<aug>
					<au>
						<snm>Pardo-Manuel de Villena</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Sapienza</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Mamm Genome</source>
				<pubdate>2001</pubdate>
				<volume>12</volume>
				<fpage>331</fpage>
				<lpage>339</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/s003350040003</pubid>
						<pubid idtype="pmpid" link="fulltext">11331939</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B57">
				<title>
					<p>Functional complementation of human centromere protein A (CENP-A) by Cse4p from <it>Saccharomyces cerevisiae</it></p>
				</title>
				<aug>
					<au>
						<snm>Wieland</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Orthaus</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Ohndorf</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Diekmann</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Hemmerich</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Mol Cell Biol</source>
				<pubdate>2004</pubdate>
				<volume>24</volume>
				<fpage>6620</fpage>
				<lpage>6630</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1128/MCB.24.15.6620-6630.2004</pubid>
						<pubid idtype="pmpid" link="fulltext">15254229</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B58">
				<title>
					<p>Distortion of female meiotic segregation and reduced male fertility in human Robertsonian translocations: consistent with the centromere model of co-evolving centromere DNA/centromeric histone (CENP-A)</p>
				</title>
				<aug>
					<au>
						<snm>Daniel</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Am J Med Genet</source>
				<pubdate>2002</pubdate>
				<volume>111</volume>
				<fpage>450</fpage>
				<lpage>452</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1002/ajmg.10618</pubid>
						<pubid idtype="pmpid" link="fulltext">12210311</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B59">
				<title>
					<p>Female meiosis drives karyotypic evolution in mammals</p>
				</title>
				<aug>
					<au>
						<snm>Pardo-Manuel de Villena</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Sapienza</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Genetics</source>
				<pubdate>2001</pubdate>
				<volume>159</volume>
				<fpage>1179</fpage>
				<lpage>1189</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">11729161</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B60">
				<title>
					<p>B chromosomes are more frequent in mammals with acrocentric karyotypes: support for the theory of centromeric drive</p>
				</title>
				<aug>
					<au>
						<snm>Palestis</snm>
						<fnm>BG</fnm>
					</au>
					<au>
						<snm>Burt</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Jones</snm>
						<fnm>RN</fnm>
					</au>
					<au>
						<snm>Trivers</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Proc R Soc Lond B Biol Sci</source>
				<pubdate>2004</pubdate>
				<volume>271</volume>
				<fpage>S22</fpage>
				<lpage>S24</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1098/rsbl.2003.0084</pubid>
						<pubid idtype="pmpid">15101408</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B61">
				<title>
					<p><it>Drosophila melanogaster </it>and <it>D. simulans </it>rescue strains produce fit offspring, despite divergent centromere-specific histone alleles</p>
				</title>
				<aug>
					<au>
						<snm>Sainz</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Wilder</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Wolf</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Hollocher</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Heredity</source>
				<pubdate>2003</pubdate>
				<volume>91</volume>
				<fpage>28</fpage>
				<lpage>35</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/sj.hdy.6800275</pubid>
						<pubid idtype="pmpid" link="fulltext">12815450</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B62">
				<aug>
					<au>
						<snm>Coyne</snm>
						<fnm>JA</fnm>
					</au>
					<au>
						<snm>Orr</snm>
						<fnm>HA</fnm>
					</au>
				</aug>
				<source>Speciation</source>
				<publisher>Sunderland: Sinauer</publisher>
				<pubdate>2004</pubdate>
			</bibl>
			<bibl id="B63">
				<title>
					<p>Genome evolution in yeasts</p>
				</title>
				<aug>
					<au>
						<snm>Dujon</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Sherman</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Fischer</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Durrens</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Casaregola</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Lafontaine</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>De Montigny</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Marck</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Neuveglise</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Talla</snm>
						<fnm>E</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nature</source>
				<pubdate>2004</pubdate>
				<volume>430</volume>
				<fpage>35</fpage>
				<lpage>44</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature02579</pubid>
						<pubid idtype="pmpid" link="fulltext">15229592</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B64">
				<title>
					<p>A DNA methyltransferase homolog with a chromodomain exists in multiple forms in <it>Arabidopsis</it></p>
				</title>
				<aug>
					<au>
						<snm>Henikoff</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Comai</snm>
						<fnm>L</fnm>
					</au>
				</aug>
				<source>Genetics</source>
				<pubdate>1998</pubdate>
				<volume>149</volume>
				<fpage>307</fpage>
				<lpage>318</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">9584105</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B65">
				<title>
					<p>Primer3 on the WWW for general users and for biologist programmers</p>
				</title>
				<aug>
					<au>
						<snm>Rozen</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Skaletsky</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Methods Mol Biol</source>
				<pubdate>2000</pubdate>
				<volume>132</volume>
				<fpage>365</fpage>
				<lpage>386</lpage>
				<xrefbib>
					<pubid idtype="pmpid">10547847</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B66">
				<title>
					<p>Gene Codes Corporation</p>
				</title>
				<url>http://www.genecodes.com</url>
			</bibl>
			<bibl id="B67">
				<title>
					<p>Basic local alignment search tool</p>
				</title>
				<aug>
					<au>
						<snm>Altschul</snm>
						<fnm>SF</fnm>
					</au>
					<au>
						<snm>Gish</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Miller</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Myers</snm>
						<fnm>EW</fnm>
					</au>
					<au>
						<snm>Lipman</snm>
						<fnm>DJ</fnm>
					</au>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>1990</pubdate>
				<volume>215</volume>
				<fpage>403</fpage>
				<lpage>410</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/jmbi.1990.9999</pubid>
						<pubid idtype="pmpid" link="fulltext">2231712</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B68">
				<title>
					<p>Gramene</p>
				</title>
				<url>http://www.gramene.org</url>
			</bibl>
			<bibl id="B69">
				<title>
					<p>TIGR Gene Indices</p>
				</title>
				<url>http://www.tigr.org/tdb/tgi</url>
			</bibl>
			<bibl id="B70">
				<title>
					<p>The sequence manipulation suite: JavaScript programs for analyzing and formatting protein and DNA sequences</p>
				</title>
				<aug>
					<au>
						<snm>Stothard</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Biotechniques</source>
				<pubdate>2000</pubdate>
				<volume>28</volume>
				<fpage>1102</fpage>
				<lpage>1104</lpage>
				<xrefbib>
					<pubid idtype="pmpid">10868275</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B71">
				<title>
					<p>Translate</p>
				</title>
				<url>http://www.ualberta.ca/~stothard/javascript/translate.html</url>
			</bibl>
			<bibl id="B72">
				<title>
					<p>CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice</p>
				</title>
				<aug>
					<au>
						<snm>Thompson</snm>
						<fnm>JD</fnm>
					</au>
					<au>
						<snm>Higgins</snm>
						<fnm>DG</fnm>
					</au>
					<au>
						<snm>Gibson</snm>
						<fnm>TJ</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>1994</pubdate>
				<volume>22</volume>
				<fpage>4673</fpage>
				<lpage>4680</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">308517</pubid>
						<pubid idtype="pmpid">7984417</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B73">
				<title>
					<p>EMBL/EBI ClustalW</p>
				</title>
				<url>http://www.ebi.ac.uk/clustalw</url>
			</bibl>
			<bibl id="B74">
				<title>
					<p>Searching databases of conserved sequence regions by aligning protein multiple-alignments</p>
				</title>
				<aug>
					<au>
						<snm>Pietrokovski</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>1996</pubdate>
				<volume>24</volume>
				<fpage>3836</fpage>
				<lpage>3845</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">146152</pubid>
						<pubid idtype="pmpid" link="fulltext">8871566</pubid>
						<pubid idtype="doi">10.1093/nar/24.19.3836</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B75">
				<title>
					<p>Blocks WWW Server</p>
				</title>
				<url>http://blocks.fhcrc.org</url>
			</bibl>
			<bibl id="B76">
				<title>
					<p>Splice site prediction in <it>Arabidopsis thaliana </it>pre-mRNA by combining local and global sequence information</p>
				</title>
				<aug>
					<au>
						<snm>Hebsgaard</snm>
						<fnm>SM</fnm>
					</au>
					<au>
						<snm>Korning</snm>
						<fnm>PG</fnm>
					</au>
					<au>
						<snm>Tolstrup</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Engelbrecht</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Rouze</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Brunak</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>1996</pubdate>
				<volume>24</volume>
				<fpage>3439</fpage>
				<lpage>3452</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">146109</pubid>
						<pubid idtype="pmpid" link="fulltext">8811101</pubid>
						<pubid idtype="doi">10.1093/nar/24.17.3439</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B77">
				<title>
					<p>NetGene2 Server</p>
				</title>
				<url>http://www.cbs.dtu.dk/services/NetGene2</url>
			</bibl>
			<bibl id="B78">
				<title>
					<p>Normalization and subtraction: two approaches to facilitate gene discovery</p>
				</title>
				<aug>
					<au>
						<snm>Bonaldo</snm>
						<fnm>MF</fnm>
					</au>
					<au>
						<snm>Lennon</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Soares</snm>
						<fnm>MB</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>1996</pubdate>
				<volume>6</volume>
				<fpage>791</fpage>
				<lpage>806</lpage>
				<xrefbib>
					<pubid idtype="pmpid">8889548</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B79">
				<title>
					<p>Identification of gene expression profile of dorsal root ganglion in the rat peripheral axotomy model of neuropathic pain</p>
				</title>
				<aug>
					<au>
						<snm>Xiao</snm>
						<fnm>HS</fnm>
					</au>
					<au>
						<snm>Huang</snm>
						<fnm>QH</fnm>
					</au>
					<au>
						<snm>Zhang</snm>
						<fnm>FX</fnm>
					</au>
					<au>
						<snm>Bao</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Lu</snm>
						<fnm>YJ</fnm>
					</au>
					<au>
						<snm>Guo</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Yang</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Huang</snm>
						<fnm>WJ</fnm>
					</au>
					<au>
						<snm>Fu</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Xu</snm>
						<fnm>SH</fnm>
					</au>
					<etal/>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2002</pubdate>
				<volume>99</volume>
				<fpage>8360</fpage>
				<lpage>8366</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">123072</pubid>
						<pubid idtype="pmpid" link="fulltext">12060780</pubid>
						<pubid idtype="doi">10.1073/pnas.122231899</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B80">
				<title>
					<p>Meat Animal Research Center</p>
				</title>
				<url>http://www.marc.usda.gov</url>
			</bibl>
			<bibl id="B81">
				<title>
					<p>Samuel Roberts Noble Foundation</p>
				</title>
				<url>http://www.noble.org</url>
			</bibl>
			<bibl id="B82">
				<title>
					<p>ATCC:The global bioresource center</p>
				</title>
				<url>http://www.atcc.org</url>
			</bibl>
			<bibl id="B83">
				<title>
					<p>Arizona Genomics Institute</p>
				</title>
				<url>http://www.genome.arizona.edu/orders</url>
			</bibl>
			<bibl id="B84">
				<title>
					<p>CUGI: Clemson University Genomics Institute</p>
				</title>
				<url>http://www.genome.clemson.edu</url>
			</bibl>
			<bibl id="B85">
				<title>
					<p>Analysis and functional annotation of an expressed sequence tag collection for tropical crop sugarcane</p>
				</title>
				<aug>
					<au>
						<snm>Vettore</snm>
						<fnm>AL</fnm>
					</au>
					<au>
						<snm>da Silva</snm>
						<fnm>FR</fnm>
					</au>
					<au>
						<snm>Kemper</snm>
						<fnm>EL</fnm>
					</au>
					<au>
						<snm>Souza</snm>
						<fnm>GM</fnm>
					</au>
					<au>
						<snm>da Silva</snm>
						<fnm>AM</fnm>
					</au>
					<au>
						<snm>Ferro</snm>
						<fnm>MI</fnm>
					</au>
					<au>
						<snm>Henrique-Silva</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Giglioti</snm>
						<fnm>EA</fnm>
					</au>
					<au>
						<snm>Lemos</snm>
						<fnm>MV</fnm>
					</au>
					<au>
						<snm>Coutinho</snm>
						<fnm>LL</fnm>
					</au>
					<etal/>
				</aug>
				<source>Genome Res</source>
				<pubdate>2003</pubdate>
				<volume>13</volume>
				<fpage>2725</fpage>
				<lpage>2735</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">403815</pubid>
						<pubid idtype="pmpid" link="fulltext">14613979</pubid>
						<pubid idtype="doi">10.1101/gr.1532103</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B86">
				<title>
					<p>Laboratory for Genomics and Bioinformatics</p>
				</title>
				<url>http://www.fungen.org</url>
			</bibl>
			<bibl id="B87">
				<title>
					<p>Rice Genome Research Program</p>
				</title>
				<url>http://rgp.dna.affrc.go.jp</url>
			</bibl>
			<bibl id="B88">
				<title>
					<p>Agriculture Research Service</p>
				</title>
				<url>http://www.ars.usda.gov</url>
			</bibl>
			<bibl id="B89">
				<title>
					<p>cerealsDB.uk.net</p>
				</title>
				<url>http://www.cerealsdb.uk.net</url>
			</bibl>
		</refgrp>
	</bm>
</art>
