Free Full Text Journal Articles: Genetics and Proteomics -- Neurotransmitter.net

Recent Articles in Nucleic Acids Research

Kikin O, Zappala Z, D'Antonio L, Bagga PS
GRSDB2 and GRS_UTRdb: databases of quadruplex forming G-rich sequences in pre-mRNAs and mRNAs.
Nucleic Acids Res. 2007 Nov 27; .
G-quadruplex motifs in the RNA play significant roles in key cellular processes and human disease. While sequences capable of forming G-quadruplexes in the pre-mRNA are involved in regulation of polyadenylation and splicing events in mammalian transcripts, the G-quadruplex motifs in the UTRs may help regulate mRNA expression. GRSDB2 is a second-generation database containing information on the composition and distribution of putative Quadruplex-forming G-Rich Sequences (QGRS) mapped in approximately 29 000 eukaryotic pre-mRNA sequences, many of which are alternatively processed. The data stored in the GRSDB2 is based on computational analysis of NCBI Entrez Gene entries with the help of an improved version of the QGRS Mapper program. The database allows complex queries with a wide variety of parameters, including Gene Ontology terms. The data is displayed in a variety of formats with several additional computational capabilities. We have also developed a new database, GRS_UTRdb, containing information on the composition and distribution patterns of putative QGRS in the 5'- and 3'-UTRs of eukaryotic mRNA sequences. The goal of these experiments has been to build freely accessible resources for exploring the role of G-quadruplex structure in regulation of gene expression at post-transcriptional level. The databases can be accessed at the G-Quadruplex Resource Site at: http://bioinformatics.ramapo.edu/GQRS/. [Abstract/Link to Full Text]

Rossignol T, Lechat P, Cuomo C, Zeng Q, Moszer I, d'Enfert C
CandidaDB: a multi-genome database for Candida species and related Saccharomycotina.
Nucleic Acids Res. 2007 Nov 26;
CandidaDB (http://genodb.pasteur.fr/CandidaDB) was established in 2002 to provide the first genomic database for the human fungal pathogen Candida albicans. The availability of an increasing number of fully or partially completed genome sequences of related fungal species has opened the path for comparative genomics and prompted us to migrate CandidaDB into a multi-genome database. The new version of CandidaDB houses the latest versions of the genomes of C. albicans strains SC5314 and WO-1 along with six genome sequences from species closely related to C. albicans that all belong to the CTG clade of Saccharomycotina-Candida tropicalis, Candida (Clavispora) lusitaniae, Candida (Pichia) guillermondii, Lodderomyces elongisporus, Debaryomyces hansenii, Pichia stipitis-and the reference Saccharomyces cerevisiae genome. CandidaDB includes sequences coding for 54 170 proteins with annotations collected from other databases, enriched with illustrations of structural features and functional domains and data of comparative analyses. In order to take advantage of the integration of multiple genomes in a unique database, new tools using pre-calculated or user-defined comparisons have been implemented that allow rapid access to comparative analysis at the genomic scale. [Abstract/Link to Full Text]

Cochrane G, Akhtar R, Aldebert P, Althorpe N, Baldwin A, Bates K, Bhattacharyya S, Bonfield J, Bower L, Browne P, Castro M, Cox T, Demiralp F, Eberhardt R, Faruque N, Hoad G, Jang M, Kulikova T, Labarga A, Leinonen R, Leonard S, Lin Q, Lopez R, Lorenc D, McWilliam H, Mukherjee G, Nardone F, Plaister S, Robinson S, Sobhany S, Vaughan R, Wu D, Zhu W, Apweiler R, Hubbard T, Birney E
Priorities for nucleotide trace, sequence and annotation data capture at the Ensembl Trace Archive and the EMBL Nucleotide Sequence Database.
Nucleic Acids Res. 2007 Nov 26;
The Ensembl Trace Archive (http://trace.ensembl.org/) and the EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl/), known together as the European Nucleotide Archive, continue to see growth in data volume and diversity. Selected major developments of 2007 are presented briefly, along with data submission and retrieval information. In the face of increasing requirements for nucleotide trace, sequence and annotation data archiving, data capture priority decisions have been taken at the European Nucleotide Archive. Priorities are discussed in terms of how reliably information can be captured, the long-term benefits of its capture and the ease with which it can be captured. [Abstract/Link to Full Text]

Sindelka R, Jon�k J, Hands R, Bustin SA, Kubista M
Intracellular expression profiles measured by real-time PCR tomography in the Xenopus laevis oocyte.
Nucleic Acids Res. 2007 Nov 26;
Real-time PCR tomography is a novel, quantitative method for measuring localized RNA expression profiles within single cells. We demonstrate its usefulness by dissecting an oocyte from Xenopus laevis into slices along its animal-vegetal axis, extracting its RNA and measuring the levels of 18 selected mRNAs by real-time RT-PCR. This identified two classes of mRNA, one preferentially located towards the animal, the other towards the vegetal pole. mRNAs within each group show comparable intracellular gradients, suggesting they are produced by similar mechanisms. The polarization is substantial, though not extreme, with around 5% of vegetal gene mRNA molecules detected at the animal pole, and around 50% of the molecules in the far most vegetal section. Most animal pole mRNAs were found in the second section from the animal pole and in the central section, which is where the nucleus is located. mRNA expression profiles did not change following in vitro fertilization and we conclude that the cortical rotation that follows fertilization has no detectable effect on intracellular mRNA gradients. [Abstract/Link to Full Text]

Farge G, Holmlund T, Khvorostova J, Rofougaran R, Hofer A, Falkenberg M
The N-terminal domain of TWINKLE contributes to single-stranded DNA binding and DNA helicase activities.
Nucleic Acids Res. 2007 Nov 26;
The TWINKLE protein is a hexameric DNA helicase required for replication of mitochondrial DNA. TWINKLE displays striking sequence similarity to the bacteriophage T7 gene 4 protein (gp4), which is a bi-functional primase-helicase required at the phage DNA replication fork. The N-terminal domain of human TWINKLE contains some of the characteristic sequence motifs found in the N-terminal primase domain of the T7 gp4, but other important motifs are missing. TWINKLE is not an active primase in vitro and the functional role of the N-terminal region has remained elusive. In this report, we demonstrate that the N-terminal part of TWINKLE is required for efficient binding to single-stranded DNA. Truncations of this region reduce DNA helicase activity and mitochondrial DNA replisome processivity. We also find that the gp4 and TWINKLE are functionally distinct. In contrast to the phage protein, TWINKLE binds to double-stranded DNA. Moreover, TWINKLE forms stable hexamers even in the absence of Mg(2+) or NTPs, which suggests that an accessory protein, a helicase loader, is needed for loading of TWINKLE onto the circular mtDNA genome. [Abstract/Link to Full Text]

Rodgers ME, Schleif R
DNA tape measurements of AraC.
Nucleic Acids Res. 2007 Nov 26;
A new method for measuring distances between points in the AraC-DNA complex was developed and applied. It utilizes variable lengths of single-stranded DNA that connect double-stranded regions containing the two half-site binding sequences of AraC. These distances plus the protein interdomain linker distances are compatible with two classes of structure for the dimeric AraC gene regulatory protein. In one class, the N-terminal regulatory arm of one dimerization domain is capable of interacting with the DNA-binding domain on the same polypeptide chain for a cis interaction. In the other class, the possible arm-DNA-binding domain interaction is trans, where it adds to the dimerization interface. [Abstract/Link to Full Text]

Zemla A, Geisbrecht B, Smith J, Lam M, Kirkpatrick B, Wagner M, Slezak T, Zhou CE
STRALCP structure alignment-based clustering of proteins.
Nucleic Acids Res. 2007 Nov 26;
Protein structural annotation and classification is an important and challenging problem in bioinformatics. Research towards analysis of sequence-structure correspondences is critical for better understanding of a protein's structure, function, and its interaction with other molecules. Clustering of protein domains based on their structural similarities provides valuable information for protein classification schemes. In this article, we attempt to determine whether structure information alone is sufficient to adequately classify protein structures. We present an algorithm that identifies regions of structural similarity within a given set of protein structures, and uses those regions for clustering. In our approach, called STRALCP (STRucture ALignment-based Clustering of Proteins), we generate detailed information about global and local similarities between pairs of protein structures, identify fragments (spans) that are structurally conserved among proteins, and use these spans to group the structures accordingly. We also provide a web server at http://as2ts.llnl.gov/AS2TS/STRALCP/ for selecting protein structures, calculating structurally conserved regions and performing automated clustering. [Abstract/Link to Full Text]

Dalal S, Chikova A, Jaeger J, Sweasy JB
The Leu22Pro tumor-associated variant of DNA polymerase beta is dRP lyase deficient.
Nucleic Acids Res. 2007 Nov 26;
Approximately 30% of human tumors characterized to date express DNA polymerase beta (pol beta) variant proteins. Two of the polymerase beta cancer-associated variants are sequence-specific mutators, and one of them binds to DNA but has no polymerase activity. The Leu22Pro (L22P) DNA polymerase beta variant was identified in a gastric carcinoma. Leu22 resides within the 8 kDa amino terminal domain of DNA polymerase beta, which exhibits dRP lyase activity. This domain catalyzes the removal of deoxyribose phosphate during short patch base excision repair. We show that this cancer-associated variant has very little dRP lyase activity but retains its polymerase activity. Although residue 22 has no direct contact with the DNA, we report here that the L22P variant has reduced DNA-binding affinity. The L22P variant protein is deficient in base excision repair. Molecular dynamics calculations suggest that alteration of Leu22 to Pro changes the local packing, the loop connecting helices 1 and 2 and the overall juxtaposition of the helices within the N-terminal domain. This in turn affects the shape of the binding pocket that is required for efficient dRP lyase catalysis. [Abstract/Link to Full Text]

Valgardsdottir R, Chiodi I, Giordano M, Rossi A, Bazzini S, Ghigna C, Riva S, Biamonti G
Transcription of Satellite III non-coding RNAs is a general stress response in human cells.
Nucleic Acids Res. 2007 Dec 11;
In heat-shocked human cells, heat shock factor 1 activates transcription of tandem arrays of repetitive Satellite III (SatIII) DNA in pericentromeric heterochromatin. Satellite III RNAs remain associated with sites of transcription in nuclear stress bodies (nSBs). Here we use real-time RT-PCR to study the expression of these genomic regions. Transcription is highly asymmetrical and most of the transcripts contain the G-rich strand of the repeat. A low level of G-rich RNAs is detectable in unstressed cells and a 10(4)-fold induction occurs after heat shock. G-rich RNAs are induced by a wide range of stress treatments including heavy metals, UV-C, oxidative and hyper-osmotic stress. Differences exist among stressing agents both for the kinetics and the extent of induction (>100- to 80.000-fold). In all cases, G-rich transcripts are associated with nSBs. On the contrary, C-rich transcripts are almost undetectable in unstressed cells and modestly increase after stress. Production of SatIII RNAs after hyper-osmotic stress depends on the Tonicity Element Binding Protein indicating that activation of the arrays is triggered by different transcription factors. This is the first example of a non-coding RNA whose transcription is controlled by different transcription factors under different growth conditions. [Abstract/Link to Full Text]

Witcher M, Pettersson F, Dup�r�-Richer D, Padovani A, Summers-Deluca L, Baldwin AS, Miller WH
Retinoic acid modulates chromatin to potentiate tumor necrosis factor alpha signaling on the DIF2 promoter.
Nucleic Acids Res. 2007 Nov 26;
Transcriptional activation by nuclear hormone receptors is well characterized, but their cooperation with other signaling pathways to activate transcription remains poorly understood. Tumor necrosis factor alpha (TNFalpha) and all-trans retinoic acid (RA) induce monocytic differentiation of acute promyelocytic leukemia (APL) cells in a synergistic manner. We used the promoter of DIF2, a gene involved in monocytic differentiation, to model the mechanism underlying the cooperative induction of target genes by RA and TNFalpha. We show a functional RA response element in the DIF2 promoter, which is constitutively bound by PML/RARalpha in APL cells. RA stimulates release of corepressors and recruitment of chromatin modifying proteins and additional transcription factors to the promoter, but these changes cause only a modest induction of DIF2 mRNA. Co-stimulation with RA plus TNFalpha facilitates binding of NF-kappaB to the promoter, which is crucial for full induction of transcription. Furthermore, RA plus TNFalpha greatly enhanced the level of RNA Pol II phosphorylation on the DIF2 promoter, via synergistic recruitment of TFIIH. We propose that RA mediates remodeling of chromatin to facilitate binding of transcription factors, which cooperate to enhance Pol II phosphorylation, providing a mechanism whereby nuclear receptors interact with other signaling pathways on the level of transcription. [Abstract/Link to Full Text]

Hines JC, Ray DS
Structure of discontinuities in kinetoplast DNA-associated minicircles during S phase in Crithidia fasciculata.
Nucleic Acids Res. 2007 Nov 26;
Kinetoplast DNA (kDNA) is a novel form of mitochondrial DNA consisting of thousands of interlocked minicircles and 20-30 maxicircles. The minicircles replicate free of the kDNA network but nicks and gaps in the newly synthesized strands remain at the time of reattachment to the kDNA network. We show here that the steady-state population of replicated, network-associated minicircles only becomes repaired to the point of having nicks with a 3'OH and 5'deoxyribonucleoside monophosphate during S phase. These nicks represent the origin/terminus of the strand and occur within the replication origins (oriA and oriB) located 180 degrees apart on the minicircle. Minicircles containing a new L strand have a single nick within either oriA or oriB but not in both origins in the same molecule. The discontinuously synthesized H strand contains single nicks within both oriA and oriB in the same molecule implying that discontinuities between the H-strand Okazaki fragments become repaired except for the fragments initiated within the two origins. Nicks in L and H strands at the origins persist throughout S phase and only become ligated as a prelude to network division. The failure to ligate these nicks until just prior to network division is not due to inappropriate termini for ligation. [Abstract/Link to Full Text]

Souli�re MF, Perreault JP, Bisaillon M
Magnesium-binding studies reveal fundamental differences between closely related RNA triphosphatases.
Nucleic Acids Res. 2007 Nov 26;
The Chlorella virus RNA triphosphatase (cvRTPase) is involved in the formation of the RNA cap structure found at the 5'-end of the viral mRNAs and requires magnesium ions to mediate its catalytic activity. To extend our studies on the role of metal ions in phosphohydrolysis, we have used a combination of fluorescence spectroscopy, circular dichroism, denaturation studies and thermodynamic analyses to monitor the binding of magnesium ions to the cvRTPase. Using these techniques, the thermodynamic forces responsible for the interaction of metal ions with an RNA triphosphatase were also evaluated for the first time. Our thermodynamic analyses indicate that the initial association of magnesium with the cvRTPase is dominated by a favorable entropic effect and is accompanied by the release of eight water molecules from the enzyme. Moreover, both fluorescence spectroscopy and circular dichroism assays indicated that minor conformational changes were occurring upon magnesium binding. Mutational studies were also performed and confirmed the importance of three specific glutamate residues located in the active site of the enzyme for the binding of magnesium ions. Finally, in contrast to the yeast RNA triphosphatase, we demonstrate that the binding of magnesium ions to the cvRTPase does not lead to the stabilization of the ground state binding of the RNA substrate. Based on the results of the present study, we hypothesize that the binding of magnesium ions induces local conformational perturbations in the active site residues that ultimately positions the lateral chains of critical amino acids involved in catalysis. Our results highlight fundamental differences in the role of magnesium ions in the phosphohydrolase reactions catalyzed by the cvRTPase and the closely related yeast RNA triphosphatase. [Abstract/Link to Full Text]

Halfon MS, Gallo SM, Bergman CM
REDfly 2.0: an integrated database of cis-regulatory modules and transcription factor binding sites in Drosophila.
Nucleic Acids Res. 2007 Nov 26;
The identification and study of the cis-regulatory elements that control gene expression are important areas of biological research, but few resources exist to facilitate large-scale bioinformatics studies of cis-regulation in metazoan species. Drosophila melanogaster, with its well-annotated genome, exceptional resources for comparative genomics and long history of experimental studies of transcriptional regulation, represents the ideal system for regulatory bioinformatics. We have merged two existing Drosophila resources, the REDfly database of cis-regulatory modules and the FlyReg database of transcription factor binding sites (TFBSs), into a single integrated database containing extensive annotation of empirically validated cis-regulatory modules and their constituent binding sites. With the enhanced functionality made possible through this integration of TFBS data into REDfly, together with additional improvements to the REDfly infrastructure, we have constructed a one-stop portal for Drosophila cis-regulatory data that will serve as a powerful resource for both computational and experimental studies of transcriptional regulation. REDfly is freely accessible at http://redfly.ccr.buffalo.edu. [Abstract/Link to Full Text]

Ara�zo-Bravo MJ, Sarai A
Indirect readout in drug-DNA recognition: role of sequence-dependent DNA conformation.
Nucleic Acids Res. 2007 Nov 26;
DNA-binding drugs have numerous applications in the engineered gene regulation. However, the drug-DNA recognition mechanism is poorly understood. Drugs can recognize specific DNA sequences not only through direct contacts but also indirectly through sequence-dependent conformation, in a similar manner to the indirect readout mechanism in protein-DNA recognition. We used a knowledge-based technique that takes advantage of known DNA structures to evaluate the conformational energies. We built a dataset of non-redundant free B-DNA crystal structures to calculate the distributions of adjacent base-step and base-pair conformations, and estimated the effective harmonic potentials of mean force (PMF). These PMFs were used to calculate the conformational energy of drug-DNA complexes, and the Z-score as a measure of the binding specificity. Comparing the Z-scores for drug-DNA complexes with those for free DNA structures with the same sequence, we observed that in several cases the Z-scores became more negative upon drug binding. Furthermore, the specificity is position-dependent within the drug-bound region of DNA. These results suggest that DNA conformation plays an important role in the drug-DNA recognition. The presented method provides a tool for the analysis of drug-DNA recognition and can facilitate the development of drugs for targeting a specific DNA sequence. [Abstract/Link to Full Text]

Finn RD, Tate J, Mistry J, Coggill PC, Sammut SJ, Hotz HR, Ceric G, Forslund K, Eddy SR, Sonnhammer EL, Bateman A
The Pfam protein families database.
Nucleic Acids Res. 2007 Nov 26;
Pfam is a comprehensive collection of protein domains and families, represented as multiple sequence alignments and as profile hidden Markov models. The current release of Pfam (22.0) contains 9318 protein families. Pfam is now based not only on the UniProtKB sequence database, but also on NCBI GenPept and on sequences from selected metagenomics projects. Pfam is available on the web from the consortium members using a new, consistent and improved website design in the UK (http://pfam.sanger.ac.uk/), the USA (http://pfam.janelia.org/) and Sweden (http://pfam.sbc.su.se/), as well as from mirror sites in France (http://pfam.jouy.inra.fr/) and South Korea (http://pfam.ccbb.re.kr/). [Abstract/Link to Full Text]

Rattei T, Tischler P, Arnold R, Hamberger F, Krebs J, Krumsiek J, Wachinger B, St�mpflen V, Mewes W
SIMAP structuring the network of protein similarities.
Nucleic Acids Res. 2007 Nov 23;
Protein sequences are the most important source of evolutionary and functional information for new proteins. In order to facilitate the computationally intensive tasks of sequence analysis, the Similarity Matrix of Proteins (SIMAP) database aims to provide a comprehensive and up-to-date dataset of the pre-calculated sequence similarity matrix and sequence-based features like InterPro domains for all proteins contained in the major public sequence databases. As of September 2007, SIMAP covers approximately 17 million proteins and more than 6 million non-redundant sequences and provides a complete annotation based on InterPro 16. Novel features of SIMAP include a new, portlet-based web portal providing multiple, structured views on retrieved proteins and integration of protein clusters and a unique search method for similar domain architectures. Access to SIMAP is freely provided for academic use through the web portal for individuals at http://mips.gsf.de/simap/and through Web Services for programmatic access at http://mips.gsf.de/webservices/services/SimapService2.0?wsdl. [Abstract/Link to Full Text]

Jones P, C�t� RG, Cho SY, Klie S, Martens L, Quinn AF, Thorneycroft D, Hermjakob H
PRIDE: new developments and new datasets.
Nucleic Acids Res. 2007 Nov 22;
The PRIDE (http://www.ebi.ac.uk/pride) database of protein and peptide identifications was previously described in the NAR Database Special Edition in 2006. Since this publication, the volume of public data in the PRIDE relational database has increased by more than an order of magnitude. Several significant public datasets have been added, including identifications and processed mass spectra generated by the HUPO Brain Proteome Project and the HUPO Liver Proteome Project. The PRIDE software development team has made several significant changes and additions to the user interface and tool set associated with PRIDE. The focus of these changes has been to facilitate the submission process and to improve the mechanisms by which PRIDE can be queried. The PRIDE team has developed a Microsoft Excel workbook that allows the required data to be collated in a series of relatively simple spreadsheets, with automatic generation of PRIDE XML at the end of the process. The ability to query PRIDE has been augmented by the addition of a BioMart interface allowing complex queries to be constructed. Collaboration with groups outside the EBI has been fruitful in extending PRIDE, including an approach to encode iTRAQ quantitative data in PRIDE XML. [Abstract/Link to Full Text]

Li D, Da L, Tang H, Li T, Zhao M
CpG methylation plays a vital role in determining tissue- and cell-specific expression of the human cell-death-inducing DFF45-like effector A gene through the regulation of Sp1/Sp3 binding.
Nucleic Acids Res. 2007 Nov 22;
Cell-death-inducing DFF45-like effector A (CIDE-A) belongs to a family of proapoptotic proteins, the expression of which is highly restricted in human tissues and cells. Here, the core region of the human CIDE-A promoter was characterized. Surprisingly, two Sp1/Sp3-binding sites, rather than tissue-specific transcription factors, were found to be required for the promoter activity. Although the ubiquitously expressed Sp1 and Sp3 were crucial, they alone could not adequately regulate the specific expression of CIDE-A. We found that the expression of CIDE-A was further regulated by CpG methylation of the promoter region. By performing bisulfite sequencing, we observed dense CpG methylation of the promoter region in tissues and cells with low or no expression of CIDE-A but not in tissues with high level of CIDE-A expression. In vitro methylation of this region showed significantly reduced transcriptional activity. Treatment of CIDE-A-negative cells with 5-aza-2'-deoxycytidine demethylated the CpG sites; this opened the closed chromatin conformation and markedly enhanced the binding affinity of Sp1/Sp3 to the promoter in vivo, thereby restoring CIDE-A expression. These data indicated that CpG methylation plays a crucial role in establishing and maintaining tissue- and cell-specific transcription of the CIDE-A gene through the regulation of Sp1/Sp3 binding. [Abstract/Link to Full Text]

Choi SW, Kano A, Maruyama A
Activation of DNA strand exchange by cationic comb-type copolymers: effect of cationic moieties of the copolymers.
Nucleic Acids Res. 2007 Nov 22;
We have previously reported that poly(l-lysine)-graft-dextran cationic comb-type copolymers accelerate strand exchange reaction between duplex DNA and its complementary single strand by >4 orders of magnitude, while stabilizing duplex. However, the stabilization of the duplex is considered principally unfavourable for the accelerating activity since the strand exchange reaction requires, at least, partial melting of the initial duplex. Here we report the effects of different cationic moieties of cationic comb-type copolymers on the accelerating activity. The copolymer having guanidino groups exhibited markedly higher accelerating effect on strand exchange reactions than that having primary amino groups. The high accelerating effect of the former is considered to be due to its lower stabilizing effect on duplex DNA, resulting from its increased affinity to single-stranded DNA. The difference in affinity was clearly demonstrated by a fluorescence correlation spectroscopy study; the interaction of the former with single-stranded DNA still remained high even at 1 M NaCl, while that of the latter completely disappeared. These results suggest that some modes of interactions, such as hydrogen bonding, other than electrostatic interactions between the copolymers having guanidino groups and DNAs may be involved in strand exchange activation. [Abstract/Link to Full Text]

Ivanyi-Nagy R, Lavergne JP, Gabus C, Ficheux D, Darlix JL
RNA chaperoning and intrinsic disorder in the core proteins of Flaviviridae.
Nucleic Acids Res. 2007 Nov 22;
RNA chaperone proteins are essential partners of RNA in living organisms and viruses. They are thought to assist in the correct folding and structural rearrangements of RNA molecules by resolving misfolded RNA species in an ATP-independent manner. RNA chaperoning is probably an entropy-driven process, mediated by the coupled binding and folding of intrinsically disordered protein regions and the kinetically trapped RNA. Previously, we have shown that the core protein of hepatitis C virus (HCV) is a potent RNA chaperone that can drive profound structural modifications of HCV RNA in vitro. We now examined the RNA chaperone activity and the disordered nature of core proteins from different Flaviviridae genera, namely that of HCV, GBV-B (GB virus B), WNV (West Nile virus) and BVDV (bovine viral diarrhoea virus). Despite low-sequence similarities, all four proteins demonstrated general nucleic acid annealing and RNA chaperone activities. Furthermore, heat resistance of core proteins, as well as far-UV circular dichroism spectroscopy suggested that a well-defined 3D protein structure is not necessary for core-induced RNA structural rearrangements. These data provide evidence that RNA chaperoning-possibly mediated by intrinsically disordered protein segments-is conserved in Flaviviridae core proteins. Thus, besides nucleocapsid formation, core proteins may function in RNA structural rearrangements taking place during virus replication. [Abstract/Link to Full Text]

Holbein S, Freimoser FM, Werner TP, Wengi A, Dichtl B
Cordycepin-hypersensitive growth links elevated polyphosphate levels to inhibition of poly(A) polymerase in Saccharomyces cerevisiae.
Nucleic Acids Res. 2007 Nov 22;
To identify genes involved in poly(A) metabolism, we screened the yeast gene deletion collection for growth defects in the presence of cordycepin (3'-deoxyadenosine), a precursor to the RNA chain terminating ATP analog cordycepin triphosphate. Deltapho80 and Deltapho85 strains, which have a constitutively active phosphate-response pathway, were identified as cordycepin hypersensitive. We show that inorganic polyphosphate (poly P) accumulated in these strains and that poly P is a potent inhibitor of poly(A) polymerase activity in vitro. Binding analyses of poly P and yeast Pap1p revealed an interaction with a k(D) in the low nanomolar range. Poly P also bound mammalian poly(A) polymerase, however, with a 10-fold higher k(D) compared to yeast Pap1p. Genetic tests with double mutants of Deltapho80 and other genes involved in phosphate homeostasis and poly P accumulation suggest that poly P contributed to cordycepin hypersensitivity. Synergistic inhibition of mRNA synthesis through poly P-mediated inhibition of Pap1p and through cordycepin-mediated RNA chain termination may thus account for hypersensitive growth of Deltapho80 and Deltapho85 strains in the presence of the chain terminator. Consistent with this, a mutation in the 3'-end formation component rna14 was synthetic lethal in combination with Deltapho80. Based on these observations, we suggest that binding of poly P to poly(A) polymerase negatively regulates its activity. [Abstract/Link to Full Text]

Mohanty BK, Kushner SR
Rho-independent transcription terminators inhibit RNase P processing of the secG leuU and metT tRNA polycistronic transcripts in Escherichia coli.
Nucleic Acids Res. 2007 Nov 22;
The widely accepted model for the processing of tRNAs in Escherichia coli involves essential initial cleavages by RNase E within polycistronic transcripts to generate pre-tRNAs that subsequently become substrates for RNase P. However, recently we identified two polycistronic tRNA transcripts whose endonucleolytic processing was solely dependent on RNase P. Here we show that the processing of the secG leuU and metT leuW glnU glnW metU glnV glnX polycistronic transcripts takes place through a different type of maturation pathway. Specifically, RNase P separates the tRNA units within each operon following the endonucleolytic removal of the distal Rho-independent transcription terminator, primarily by RNase E. Failure to remove the Rho-independent transcription terminator inhibits RNase P processing of both transcripts leading to a decrease in mature tRNA levels and dramatically increased levels of full-length transcripts in an RNase E deletion strain. Furthermore, we show for the first time that RNase G also removes the Rho-independent transcription terminator associated with the secG leuU operon. Our data also demonstrate that the Rne-1 protein retains significant activity on tRNA substrates at the non-permissive temperature. Taken together it is clear that there are multiple pathways involved in the maturation of tRNAs in E. coli. [Abstract/Link to Full Text]

Hernandez-Boussard T, Whirl-Carrillo M, Hebert JM, Gong L, Owen R, Gong M, Gor W, Liu F, Truong C, Whaley R, Woon M, Zhou T, Altman RB, Klein TE
The pharmacogenetics and pharmacogenomics knowledge base: accentuating the knowledge.
Nucleic Acids Res. 2007 Nov 21;
PharmGKB is a knowledge base that captures the relationships between drugs, diseases/phenotypes and genes involved in pharmacokinetics (PK) and pharmacodynamics (PD). This information includes literature annotations, primary data sets, PK and PD pathways, and expert-generated summaries of PK/PD relationships between drugs, diseases/phenotypes and genes. PharmGKB's website is designed to effectively disseminate knowledge to meet the needs of our users. PharmGKB currently has literature annotations documenting the relationship of over 500 drugs, 450 diseases and 600 variant genes. In order to meet the needs of whole genome studies, PharmGKB has added new functionalities, including browsing the variant display by chromosome and cytogenetic locations, allowing the user to view variants not located within a gene. We have developed new infrastructure for handling whole genome data, including increased methods for quality control and tools for comparison across other data sources, such as dbSNP, JSNP and HapMap data. PharmGKB has also added functionality to accept, store, display and query high throughput SNP array data. These changes allow us to capture more structured information on phenotypes for better cataloging and comparison of data. PharmGKB is available at www.pharmgkb.org. [Abstract/Link to Full Text]

Tremblay S, Wagner JR
Dehydration, deamination and enzymatic repair of cytosine glycols from oxidized poly(dG-dC) and poly(dI-dC).
Nucleic Acids Res. 2007 Nov 21;
Cytosine glycols (5,6-dihydroxy-5,6-dihydrocytosine) are initial products of cytosine oxidation. Because these products are not stable, virtually all biological studies have focused on the stable oxidation products of cytosine, including 5-hydroxycytosine, uracil glycols and 5-hydroxyuracil. Previously, we reported that the lifetime of cytosine glycols was greatly enhanced in double-stranded DNA, thus implicating these products in DNA repair and mutagenesis. In the present work, cytosine and uracil glycols were generated in double-stranded alternating co-polymers by oxidation with KMnO(4). The half-life of cytosine glycols in poly(dG-dC) was 6.5 h giving a ratio of dehydration to deamination of 5:1. At high substrate concentrations, the excision of cytosine glycols from poly(dG-dC) by purified endonuclease III was comparable to that of uracil glycols, whereas the excision of these substrates was 5-fold greater than that of 5-hydroxycytosine. Kinetic studies revealed that the V(max) was several fold higher for the excision of cytosine glycols compared to 5-hydroxycytosine. In contrast to cytosine glycols, uracil glycols did not undergo detectable dehydration to 5-hydroxyuracil. Replacing poly(dG-dC) for poly(dI-dC) gave similar results with respect to the lifetime and excision of cytosine glycols. This work demonstrates the formation of cytosine glycols in DNA and their removal by base excision repair. [Abstract/Link to Full Text]

Hatch K, Danilowicz C, Coljee V, Prentiss M
Measurement of the salt-dependent stabilization of partially open DNA by Escherichia coli SSB protein.
Nucleic Acids Res. 2007 Nov 21;
The rezipping force of two complementary DNA strands under tension has been measured in the presence of Escherichia coli single-stranded-binding proteins under salt conditions ranging from 10- to 400 mM NaCl. The effectiveness of the binding protein in preventing rezipping is strongly dependent on salt concentration and compared with the salt dependence in the absence of the protein. At concentrations less than 50 mM NaCl, the protein prevents complete rezipping of lambda-phage on the 2-s timescale of the experiment, when the ssDNA is under tensions as low as 3.5 +/- 1 pN. For salt concentrations greater than 200 mM NaCl, the protein inhibits rezipping but cannot block rezipping when the tension is reduced below 6 +/- 1.8 pN. This change in effectiveness as a function of salt concentration may correspond to salt-dependent changes in binding modes that were previously observed in bulk assays. [Abstract/Link to Full Text]

Nord D, Sj�berg BM
Unconventional GIY-YIG homing endonuclease encoded in group I introns in closely related strains of the Bacillus cereus group.
Nucleic Acids Res. 2007 Nov 21;
Several group I introns have been previously found in strains of the Bacillus cereus group at three different insertion sites in the nrdE gene of the essential nrdIEF operon coding for ribonucleotide reductase. Here, we identify an uncharacterized group IA intron in the nrdF gene in 12 strains of the B. cereus group and show that the pre-mRNA is efficiently spliced. The Bacillus thuringiensis ssp. pakistani nrdF intron encodes a homing endonuclease, denoted I-BthII, with an unconventional GIY-(X)(8)-YIG motif that cleaves an intronless nrdF gene 7 nt upstream of the intron insertion site, producing 2-nt 3' extensions. We also found four additional occurrences of two of the previously reported group I introns in the nrdE gene of 25 sequenced B. thuringiensis and one B. cereus strains, and one non-annotated group I intron at a fourth nrdE insertion site in the B. thuringiensis ssp. Al Hakam sequenced genome. Two strains contain introns in both the nrdE and the nrdF genes. Phylogenetic studies of the nrdIEF operon from 39 strains of the B. cereus group suggest several events of horizontal gene transfer for two of the introns found in this operon. [Abstract/Link to Full Text]

Yeats C, Lees J, Reid A, Kellam P, Martin N, Liu X, Orengo C
Gene3D: comprehensive structural and functional annotation of genomes.
Nucleic Acids Res. 2007 Nov 21;
Gene3D provides comprehensive structural and functional annotation of most available protein sequences, including the UniProt, RefSeq and Integr8 resources. The main structural annotation is generated through scanning these sequences against the CATH structural domain database profile-HMM library. CATH is a database of manually derived PDB-based structural domains, placed within a hierarchy reflecting topology, homology and conservation and is able to infer more ancient and divergent homology relationships than sequence-based approaches. This data is supplemented with Pfam-A, other non-domain structural predictions (i.e. coiled coils) and experimental data from UniProt. In order to enhance the investigations possible with this data, we have also incorporated a variety of protein annotation resources, including protein-protein interaction data, GO functional assignments, KEGG pathways, FUNCAT functional descriptions and links to microarray expression data. All of this data can be accessed through a newly re-designed website that has a focus on flexibility and clarity, with searches that can be restricted to a single genome or across the entire sequence database. Currently Gene3D contains over 3.5 million domain assignments for nearly 5 million proteins including 527 completed genomes. This is available at: http://gene3d.biochem.ucl.ac.uk/ [Abstract/Link to Full Text]

Wardle J, Burgers PM, Cann IK, Darley K, Heslop P, Johansson E, Lin LJ, McGlynn P, Sanvoisin J, Stith CM, Connolly BA
Uracil recognition by replicative DNA polymerases is limited to the archaea, not occurring with bacteria and eukarya.
Nucleic Acids Res. 2007 Nov 21;
Family B DNA polymerases from archaea such as Pyrococcus furiosus, which live at temperatures approximately 100 degrees C, specifically recognize uracil in DNA templates and stall replication in response to this base. Here it is demonstrated that interaction with uracil is not restricted to hyperthermophilic archaea and that the polymerase from mesophilic Methanosarcina acetivorans shows identical behaviour. The family B DNA polymerases replicate the genomes of archaea, one of the three fundamental domains of life. This publication further shows that the DNA replicating polymerases from the other two domains, bacteria (polymerase III) and eukaryotes (polymerases delta and epsilon for nuclear DNA and polymerase gamma for mitochondrial) are also unable to recognize uracil. Uracil occurs in DNA as a result of deamination of cytosine, either in G:C base-pairs or, more rapidly, in single stranded regions produced, for example, during replication. The resulting G:U mis-pairs/single stranded uracils are promutagenic and, unless repaired, give rise to G:C to A:T transitions in 50% of the progeny. The confinement of uracil recognition to polymerases of the archaeal domain is discussed in terms of the DNA repair pathways necessary for the elimination of uracil. [Abstract/Link to Full Text]

Kobayashi Y, Matsuo M, Sakamoto K, Wakasugi T, Yamada K, Obokata J
Two RNA editing sites with cis-acting elements of moderate sequence identity are recognized by an identical site-recognition protein in tobacco chloroplasts.
Nucleic Acids Res. 2007 Nov 21;
The chloroplast genome of higher plants contains 20-40 C-to-U RNA editing sites, whose number and locations are diversified among plant species. Biochemical analyses using in vitro RNA editing systems with chloroplast extracts have suggested that there is one-to-one recognition between proteinous site recognition factors and their respective RNA editing sites, but their rigidness and generality are still unsettled. In this study, we addressed this question with the aid of an in vitro RNA editing system from tobacco chloroplast extracts and with UV-crosslinking experiments. We found that the ndhB-9 and ndhF-1 editing sites of tobacco chloroplast transcripts are both bound by the site-specific trans-acting factors of 95 kDa. Cross-competition experiments between ndhB-9 and ndhF-1 RNAs demonstrated that the 95 kDa proteins specifically binding to the ndhB-9 and ndhF-1 sites are the identical protein. The binding regions of the 95 kDa protein on the ndhB-9 and ndhF-1 transcripts showed 60% identity in nucleotide sequence. This is the first biochemical demonstration that a site recognition factor of chloroplast RNA editing recognizes plural sites. On the basis of this finding, we discuss how plant organellar RNA editing sites have diverged during evolution. [Abstract/Link to Full Text]

Lechat P, Hummel L, Rousseau S, Moszer I
GenoList: an integrated environment for comparative analysis of microbial genomes.
Nucleic Acids Res. 2007 Nov 21;
The multitude of bacterial genome sequences being determined has generated new requirements regarding the development of databases and graphical interfaces: these are needed to organize and retrieve biological information from the comparison of large sets of genomes. GenoList (http://genolist.pasteur.fr/GenoList) is an integrated environment dedicated to querying and analyzing genome data from bacterial species. GenoList inherits from the SubtiList database and web server, the reference data resource for the Bacillus subtilis genome. The data model was extended to hold information about relationships between genomes (e.g. protein families). The web user interface was designed to primarily take into account biologists' needs and modes of operation. Along with standard query and browsing capabilities, comparative genomics facilities are available, including subtractive proteome analysis. One key feature is the integration of the many tools accessible in the environment. As an example, it is straightforward to identify the genes that are specific to a group of bacteria, export them as a tab-separated list, get their protein sequences and run a multiple alignment on a subset of these sequences. [Abstract/Link to Full Text]

Recent Articles in Genome Research

Dennis JH, Fan HY, Reynolds SM, Yuan G, Meldrim JC, Richter DJ, Peterson DG, Rando OJ, Noble WS, Kingston RE
Independent and complementary methods for large-scale structural analysis of mammalian chromatin.
Genome Res. 2007 Jun;17(6):928-39.
The fundamental building block of chromatin, the nucleosome, occupies 150 bp of DNA in a spaced arrangement that is a primary determinant in regulation of the genome. The nucleosomal organization of some regions of the human genome has been described, but mapping of these regions has been limited to a few kilobases. We have explored two independent and complementary methods for the high-throughput analysis of mammalian chromatin structure. Through adaptations to a protocol used to map yeast chromatin structure, we determined sites of nucleosomal protection over large regions of the mammalian genome using a tiling microarray. By modifying classical primer extension methods, we localized specific internucleosomally cleaved mammalian genomic sequences using a capillary electrophoresis sequencer in a manner that allows high-throughput nucleotide-resolution characterization of nucleosome protection patterns. We developed algorithms for the automated and unbiased analysis of the resulting data, a necessary step toward large-scale analysis. We validated these assays using the known positions of nucleosomes on the mouse mammary tumor virus LTR, and additionally, we characterized the previously unreported chromatin structure of the LCMT2 gene. These results demonstrate the effectiveness of the combined methods for reliable analysis of mammalian chromatin structure in a high-throughput manner. [Abstract/Link to Full Text]

Thurman RE, Day N, Noble WS, Stamatoyannopoulos JA
Identification of higher-order functional domains in the human ENCODE regions.
Genome Res. 2007 Jun;17(6):917-27.
It has long been posited that human and other large genomes are organized into higher-order (i.e., greater than gene-sized) functional domains. We hypothesized that diverse experimental data types generated by The ENCODE Project Consortium could be combined to delineate active and quiescent or repressed functional domains and thereby illuminate the higher-order functional architecture of the genome. To address this, we coupled wavelet analysis with hidden Markov models for unbiased discovery of "domain-level" behavior in high-resolution functional genomic data, including activating and repressive histone modifications, RNA output, and DNA replication timing. We find that higher-order patterns in these data types are largely concordant and may be analyzed collectively in the context of HeLa cells to delineate 53 active and 62 repressed functional domains within the ENCODE regions. Active domains comprise approximately 44% of the ENCODE regions but contain approximately 75%-80% of annotated genes, transcripts, and CpG islands. Repressed domains are enriched in certain classes of repetitive elements and, surprisingly, in evolutionarily conserved nonexonic sequences. The functional domain structure of the ENCODE regions appears to be largely stable across different cell types. Taken together, our results suggest that higher-order functional domains represent a fundamental organizing principle of human genome architecture. [Abstract/Link to Full Text]

Bhinge AA, Kim J, Euskirchen GM, Snyder M, Iyer VR
Mapping the chromosomal targets of STAT1 by Sequence Tag Analysis of Genomic Enrichment (STAGE).
Genome Res. 2007 Jun;17(6):910-6.
Identifying the genome-wide binding sites of transcription factors is important in deciphering transcriptional regulatory networks. ChIP-chip (Chromatin immunoprecipitation combined with microarrays) has been widely used to map transcription factor binding sites in the human genome. However, whole genome ChIP-chip analysis is still technically challenging in vertebrates. We recently developed STAGE as an unbiased method for identifying transcription factor binding sites in the genome. STAGE is conceptually based on SAGE, except that the input is ChIP-enriched DNA. In this study, we implemented an improved sequencing strategy and analysis methods and applied STAGE to map the genomic binding profile of the transcription factor STAT1 after interferon treatment. STAT1 is mainly responsible for mediating the cellular responses to interferons, such as cell proliferation, apoptosis, immune surveillance, and immune responses. We present novel algorithms for STAGE tag analysis to identify enriched loci with high specificity, as verified by quantitative ChIP. STAGE identified several previously unknown STAT1 target genes, many of which are involved in mediating the response to interferon-gamma signaling. STAGE is thus a viable method for identifying the chromosomal targets of transcription factors and generating meaningful biological hypotheses that further our understanding of transcriptional regulatory networks. [Abstract/Link to Full Text]

Euskirchen GM, Rozowsky JS, Wei CL, Lee WH, Zhang ZD, Hartman S, Emanuelsson O, Stolc V, Weissman S, Gerstein MB, Ruan Y, Snyder M
Mapping of transcription factor binding regions in mammalian cells by ChIP: comparison of array- and sequencing-based technologies.
Genome Res. 2007 Jun;17(6):898-909.
Recent progress in mapping transcription factor (TF) binding regions can largely be credited to chromatin immunoprecipitation (ChIP) technologies. We compared strategies for mapping TF binding regions in mammalian cells using two different ChIP schemes: ChIP with DNA microarray analysis (ChIP-chip) and ChIP with DNA sequencing (ChIP-PET). We first investigated parameters central to obtaining robust ChIP-chip data sets by analyzing STAT1 targets in the ENCODE regions of the human genome, and then compared ChIP-chip to ChIP-PET. We devised methods for scoring and comparing results among various tiling arrays and examined parameters such as DNA microarray format, oligonucleotide length, hybridization conditions, and the use of competitor Cot-1 DNA. The best performance was achieved with high-density oligonucleotide arrays, oligonucleotides >/=50 bases (b), the presence of competitor Cot-1 DNA and hybridizations conducted in microfluidics stations. When target identification was evaluated as a function of array number, 80%-86% of targets were identified with three or more arrays. Comparison of ChIP-chip with ChIP-PET revealed strong agreement for the highest ranked targets with less overlap for the low ranked targets. With advantages and disadvantages unique to each approach, we found that ChIP-chip and ChIP-PET are frequently complementary in their relative abilities to detect STAT1 targets for the lower ranked targets; each method detected validated targets that were missed by the other method. The most comprehensive list of STAT1 binding regions is obtained by merging results from ChIP-chip and ChIP-sequencing. Overall, this study provides information for robust identification, scoring, and validation of TF targets using ChIP-based technologies. [Abstract/Link to Full Text]

Karnani N, Taylor C, Malhotra A, Dutta A
Pan-S replication patterns and chromosomal domains defined by genome-tiling arrays of ENCODE genomic areas.
Genome Res. 2007 Jun;17(6):865-76.
In eukaryotes, accurate control of replication time is required for the efficient completion of S phase and maintenance of genome stability. We present a high-resolution genome-tiling array-based profile of replication timing for approximately 1% of the human genome studied by The ENCODE Project Consortium. Twenty percent of the investigated segments replicate asynchronously (pan-S). These areas are rich in genes and CpG islands, features they share with early-replicating loci. Interphase FISH showed that pan-S replication is a consequence of interallelic variation in replication time and is not an artifact derived from a specific cell cycle synchronization method or from aneuploidy. The interallelic variation in replication time is likely due to interallelic variation in chromatin environment, because while the early- or late-replicating areas were exclusively enriched in activating or repressing histone modifications, respectively, the pan-S areas had both types of histone modification. The replication profile of the chromosomes identified contiguous chromosomal segments of hundreds of kilobases separated by smaller segments where the replication time underwent an acute transition. Close examination of one such segment demonstrated that the delay of replication time was accompanied by a decrease in level of gene expression and appearance of repressive chromatin marks, suggesting that the transition segments are boundary elements separating chromosomal domains with different chromatin environments. [Abstract/Link to Full Text]

Washietl S, Pedersen JS, Korbel JO, Stocsits C, Gruber AR, Hackerm�ller J, Hertel J, Lindemeyer M, Reiche K, Tanzer A, Ucla C, Wyss C, Antonarakis SE, Denoeud F, Lagarde J, Drenkow J, Kapranov P, Gingeras TR, Guig� R, Snyder M, Gerstein MB, Reymond A, Hofacker IL, Stadler PF
Structured RNAs in the ENCODE selected regions of the human genome.
Genome Res. 2007 Jun;17(6):852-64.
Functional RNA structures play an important role both in the context of noncoding RNA transcripts as well as regulatory elements in mRNAs. Here we present a computational study to detect functional RNA structures within the ENCODE regions of the human genome. Since structural RNAs in general lack characteristic signals in primary sequence, comparative approaches evaluating evolutionary conservation of structures are most promising. We have used three recently introduced programs based on either phylogenetic-stochastic context-free grammar (EvoFold) or energy directed folding (RNAz and AlifoldZ), yielding several thousand candidate structures (corresponding to approximately 2.7% of the ENCODE regions). EvoFold has its highest sensitivity in highly conserved and relatively AU-rich regions, while RNAz favors slightly GC-rich regions, resulting in a relatively small overlap between methods. Comparison with the GENCODE annotation points to functional RNAs in all genomic contexts, with a slightly increased density in 3'-UTRs. While we estimate a significant false discovery rate of approximately 50%-70% many of the predictions can be further substantiated by additional criteria: 248 loci are predicted by both RNAz and EvoFold, and an additional 239 RNAz or EvoFold predictions are supported by the (more stringent) AlifoldZ algorithm. Five hundred seventy RNAz structure predictions fall into regions that show signs of selection pressure also on the sequence level (i.e., conserved elements). More than 700 predictions overlap with noncoding transcripts detected by oligonucleotide tiling arrays. One hundred seventy-five selected candidates were tested by RT-PCR in six tissues, and expression could be verified in 43 cases (24.6%). [Abstract/Link to Full Text]

Zheng D, Frankish A, Baertsch R, Kapranov P, Reymond A, Choo SW, Lu Y, Denoeud F, Antonarakis SE, Snyder M, Ruan Y, Wei CL, Gingeras TR, Guig� R, Harrow J, Gerstein MB
Pseudogenes in the ENCODE regions: consensus annotation, analysis of transcription, and evolution.
Genome Res. 2007 Jun;17(6):839-51.
Arising from either retrotransposition or genomic duplication of functional genes, pseudogenes are "genomic fossils" valuable for exploring the dynamics and evolution of genes and genomes. Pseudogene identification is an important problem in computational genomics, and is also critical for obtaining an accurate picture of a genome's structure and function. However, no consensus computational scheme for defining and detecting pseudogenes has been developed thus far. As part of the ENCyclopedia Of DNA Elements (ENCODE) project, we have compared several distinct pseudogene annotation strategies and found that different approaches and parameters often resulted in rather distinct sets of pseudogenes. We subsequently developed a consensus approach for annotating pseudogenes (derived from protein coding genes) in the ENCODE regions, resulting in 201 pseudogenes, two-thirds of which originated from retrotransposition. A survey of orthologs for these pseudogenes in 28 vertebrate genomes showed that a significant fraction ( approximately 80%) of the processed pseudogenes are primate-specific sequences, highlighting the increasing retrotransposition activity in primates. Analysis of sequence conservation and variation also demonstrated that most pseudogenes evolve neutrally, and processed pseudogenes appear to have lost their coding potential immediately or soon after their emergence. In order to explore the functional implication of pseudogene prevalence, we have extensively examined the transcriptional activity of the ENCODE pseudogenes. We performed systematic series of pseudogene-specific RACE analyses. These, together with complementary evidence derived from tiling microarrays and high throughput sequencing, demonstrated that at least a fifth of the 201 pseudogenes are transcribed in one or more cell lines or tissues. [Abstract/Link to Full Text]

Ruan Y, Ooi HS, Choo SW, Chiu KP, Zhao XD, Srinivasan KG, Yao F, Choo CY, Liu J, Ariyaratne P, Bin WG, Kuznetsov VA, Shahab A, Sung WK, Bourque G, Palanisamy N, Wei CL
Fusion transcripts and transcribed retrotransposed loci discovered through comprehensive transcriptome analysis using Paired-End diTags (PETs).
Genome Res. 2007 Jun;17(6):828-38.
Identification of unconventional functional features such as fusion transcripts is a challenging task in the effort to annotate all functional DNA elements in the human genome. Paired-End diTag (PET) analysis possesses a unique capability to accurately and efficiently characterize the two ends of DNA fragments, which may have either normal or unusual compositions. This unique nature of PET analysis makes it an ideal tool for uncovering unconventional features residing in the human genome. Using the PET approach for comprehensive transcriptome analysis, we were able to identify fusion transcripts derived from genome rearrangements and actively expressed retrotransposed pseudogenes, which would be difficult to capture by other means. Here, we demonstrate this unique capability through the analysis of 865,000 individual transcripts in two types of cancer cells. In addition to the characterization of a large number of differentially expressed alternative 5' and 3' transcript variants and novel transcriptional units, we identified 70 fusion transcript candidates in this study. One was validated as the product of a fusion gene between BCAS4 and BCAS3 resulting from an amplification followed by a translocation event between the two loci, chr20q13 and chr17q23. Through an examination of PETs that mapped to multiple genomic locations, we identified 4055 retrotransposed loci in the human genome, of which at least three were found to be transcriptionally active. The PET mapping strategy presented here promises to be a useful tool in annotating the human genome, especially aberrations in human cancer genomes. [Abstract/Link to Full Text]

Lin JM, Collins PJ, Trinklein ND, Fu Y, Xi H, Myers RM, Weng Z
Transcription factor binding and modified histones in human bidirectional promoters.
Genome Res. 2007 Jun;17(6):818-27.
Bidirectional promoters have received considerable attention because of their ability to regulate two downstream genes (divergent genes). They are also highly abundant, directing the transcription of approximately 11% of genes in the human genome. We categorized the presence of DNA sequence motifs, binding of transcription factors, and modified histones as overrepresented, shared, or underrepresented in bidirectional promoters with respect to unidirectional promoters. We found that a small set of motifs, including GABPA, MYC, E2F1, E2F4, NRF-1, CCAAT, YY1, and ACTACAnnTCC are overrepresented in bidirectional promoters, while the majority (73%) of known vertebrate motifs are underrepresented. We performed chromatin-immunoprecipitation (ChIP), followed by quantitative PCR for GABPA, on 118 regions in the human genome and showed that it binds to bidirectional promoters more frequently than unidirectional promoters, and its position-specific scoring matrix is highly predictive of binding. Signatures of active transcription, such as occupancy of RNA polymerase II and the modified histones H3K4me2, H3K4me3, and H3ac, are overrepresented in regions around bidirectional promoters, suggesting that a higher fraction of divergent genes are transcribed in a given cell than the fraction of other genes. Accordingly, analysis of whole-genome microarray data indicates that 68% of divergent genes are transcribed compared with 44% of all human genes. By combining the analysis of publicly available ENCODE data and a detailed study of GABPA, we survey bidirectional promoters with breadth and depth, leading to biological insights concerning their motif composition and bidirectional regulatory mode. [Abstract/Link to Full Text]

Jin VX, O'Geen H, Iyengar S, Green R, Farnham PJ
Identification of an OCT4 and SRY regulatory module using integrated computational and experimental genomics approaches.
Genome Res. 2007 Jun;17(6):807-17.
ChIP-chip studies have revealed that many in vivo binding sites have a weak match to the consensus sequence for the transcription factor being analyzed. Possible explanations for these observations include (1) the in vitro-derived consensus site does not represent the in vivo binding site and/or (2) the factor is recruited to a weak binding site via interaction with another protein. To address these possibilities, we developed an approach (ChIPMotifs) that incorporates a bootstrap resampling method to statistically infer the optimal cutoff threshold for a position weight matrix (PWM) of a motif identified from ChIP-chip data by ab initio motif discovery programs. Using OCT4 ChIP-chip data and the ChIPMotifs approach, we first developed a refined OCT4 PWM. We then used the refined PWM and a ChIPModules approach to identify transcription factors colocalizing with OCT4 in Ntera2 testicular embryonal carcinoma cells. We found that the consensus binding site for SRY, a transcription factor critical for testis development, colocalizes with the OCT4 PWM. To further characterize the relationship between OCT4 and SRY, we performed ChIP-chip experiments with human promoter microarrays, and found that 49% of the top approximately 1000 OCT4 target promoters were also bound by SRY. This analysis represents the first identification of SRY target promoters. Interestingly, we determined that promoters bound by OCT4 and SRY, but not those bound by SRY alone, were also bound by the transcriptional repressor KAP1. Our studies not only validate the ChIPMotifs and ChIPModules combinatorial approach but also identify a possible new regulatory partner of OCT4. [Abstract/Link to Full Text]

Xi H, Yu Y, Fu Y, Foley J, Halees A, Weng Z
Analysis of overrepresented motifs in human core promoters reveals dual regulatory roles of YY1.
Genome Res. 2007 Jun;17(6):798-806.
A set of 723 high-quality human core promoter sequences were compiled and analyzed for overrepresented motifs. Beside the two well-characterized core promoter motifs (TATA and Inr), several known motifs (YY1, Sp1, NRF-1, NRF-2, CAAT, and CREB) and one potentially new motif (motif8) were found. Interestingly, YY1 and motif8 mostly reside immediately downstream from the TSS. In particular, the YY1 motif occurs primarily in genes with 5'-UTRs shorter than 40 base pairs (bp) and its locations coincide with the translation start site. We verified that the YY1 motif is bound by YY1 in vitro. We then performed detailed analysis on YY1 chromatin immunoprecipitation data with a whole-genome human promoter microarray (ChIP-chip) and revealed that the thus identified promoters in HeLa cells were highly enriched with the YY1 motif. Moreover, the motif overlapped with the translation start sites on the plus strand of a group of genes, many with short 5'-UTRs, and with the transcription start sites on the minus strand of another distinct group of genes; together, the two groups of genes accounted for the majority of the YY1-bound promoters in the ChIP-chip data. Furthermore, the first group of genes was highly enriched in the functional categories of ribosomal proteins and nuclear-encoded mitochondria proteins. We suggest that the YY1 motif plays a dual role in both transcription and translation initiation of these genes. We also discuss the evolutionary advantages of housing a transcriptional element inside the transcript in terms of the migration of these genes in the human genome. [Abstract/Link to Full Text]

Zhang ZD, Paccanaro A, Fu Y, Weissman S, Weng Z, Chang J, Snyder M, Gerstein MB
Statistical analysis of the genomic distribution and correlation of regulatory elements in the ENCODE regions.
Genome Res. 2007 Jun;17(6):787-97.
The comprehensive inventory of functional elements in 44 human genomic regions carried out by the ENCODE Project Consortium enables for the first time a global analysis of the genomic distribution of transcriptional regulatory elements. In this study we developed an intuitive and yet powerful approach to analyze the distribution of regulatory elements found in many different ChIP-chip experiments on a 10 approximately 100-kb scale. First, we focus on the overall chromosomal distribution of regulatory elements in the ENCODE regions and show that it is highly nonuniform. We demonstrate, in fact, that regulatory elements are associated with the location of known genes. Further examination on a local, single-gene scale shows an enrichment of regulatory elements near both transcription start and end sites. Our results indicate that overall these elements are clustered into regulatory rich "islands" and poor "deserts." Next, we examine how consistent the nonuniform distribution is between different transcription factors. We perform on all the factors a multivariate analysis in the framework of a biplot, which enhances biological signals in the experiments. This groups transcription factors into sequence-specific and sequence-nonspecific clusters. Moreover, with experimental variation carefully controlled, detailed correlations show that the distribution of sites was generally reproducible for a specific factor between different laboratories and microarray platforms. Data sets associated with histone modifications have particularly strong correlations. Finally, we show how the correlations between factors change when only regulatory elements far from the transcription start sites are considered. [Abstract/Link to Full Text]

King DC, Taylor J, Zhang Y, Cheng Y, Lawson HA, Martin J, Chiaromonte F, Miller W, Hardison RC
Finding cis-regulatory elements using comparative genomics: some lessons from ENCODE data.
Genome Res. 2007 Jun;17(6):775-86.
Identification of functional genomic regions using interspecies comparison will be most effective when the full span of relationships between genomic function and evolutionary constraint are utilized. We find that sets of putative transcriptional regulatory sequences, defined by ENCODE experimental data, have a wide span of evolutionary histories, ranging from stringent constraint shown by deep phylogenetic comparisons to recent selection on lineage-specific elements. This diversity of evolutionary histories can be captured, at least in part, by the suite of available comparative genomics tools, especially after correction for regional differences in the neutral substitution rate. Putative transcriptional regulatory regions show alignability in different clades, and the genes associated with them are enriched for distinct functions. Some of the putative regulatory regions show evidence for recent selection, including a primate-specific, distal promoter that may play a novel role in regulation. [Abstract/Link to Full Text]

Margulies EH, Cooper GM, Asimenos G, Thomas DJ, Dewey CN, Siepel A, Birney E, Keefe D, Schwartz AS, Hou M, Taylor J, Nikolaev S, Montoya-Burgos JI, L�ytynoja A, Whelan S, Pardi F, Massingham T, Brown JB, Bickel P, Holmes I, Mullikin JC, Ureta-Vidal A, Paten B, Stone EA, Rosenbloom KR, Kent WJ, Bouffard GG, Guan X, Hansen NF, Idol JR, Maduro VV, Maskeri B, McDowell JC, Park M, Thomas PJ, Young AC, Blakesley RW, Muzny DM, Sodergren E, Wheeler DA, Worley KC, Jiang H, Weinstock GM, Gibbs RA, Graves T, Fulton R, Mardis ER, Wilson RK, Clamp M, Cuff J, Gnerre S, Jaffe DB, Chang JL, Lindblad-Toh K, Lander ES, Hinrichs A, Trumbower H, Clawson H, Zweig A, Kuhn RM, Barber G, Harte R, Karolchik D, Field MA, Moore RA, Matthewson CA, Schein JE, Marra MA, Antonarakis SE, Batzoglou S, Goldman N, Hardison R, Haussler D, Miller W, Pachter L, Green ED, Sidow A
Analyses of deep mammalian sequence alignments and constraint predictions for 1% of the human genome.
Genome Res. 2007 Jun;17(6):760-74.
A key component of the ongoing ENCODE project involves rigorous comparative sequence analyses for the initially targeted 1% of the human genome. Here, we present orthologous sequence generation, alignment, and evolutionary constraint analyses of 23 mammalian species for all ENCODE targets. Alignments were generated using four different methods; comparisons of these methods reveal large-scale consistency but substantial differences in terms of small genomic rearrangements, sensitivity (sequence coverage), and specificity (alignment accuracy). We describe the quantitative and qualitative trade-offs concomitant with alignment method choice and the levels of technical error that need to be accounted for in applications that require multisequence alignments. Using the generated alignments, we identified constrained regions using three different methods. While the different constraint-detecting methods are in general agreement, there are important discrepancies relating to both the underlying alignments and the specific algorithms. However, by integrating the results across the alignments and constraint-detecting methods, we produced constraint annotations that were found to be robust based on multiple independent measures. Analyses of these annotations illustrate that most classes of experimentally annotated functional elements are enriched for constrained sequences; however, large portions of each class (with the exception of protein-coding sequences) do not overlap constrained regions. The latter elements might not be under primary sequence constraint, might not be constrained across all mammals, or might have expendable molecular functions. Conversely, 40% of the constrained sequences do not overlap any of the functional elements that have been experimentally identified. Together, these findings demonstrate and quantify how many genomic functional elements await basic molecular characterization. [Abstract/Link to Full Text]

Denoeud F, Kapranov P, Ucla C, Frankish A, Castelo R, Drenkow J, Lagarde J, Alioto T, Manzano C, Chrast J, Dike S, Wyss C, Henrichsen CN, Holroyd N, Dickson MC, Taylor R, Hance Z, Foissac S, Myers RM, Rogers J, Hubbard T, Harrow J, Guig� R, Gingeras TR, Antonarakis SE, Reymond A
Prominent use of distal 5' transcription start sites and discovery of a large number of additional exons in ENCODE regions.
Genome Res. 2007 Jun;17(6):746-59.
This report presents systematic empirical annotation of transcript products from 399 annotated protein-coding loci across the 1% of the human genome targeted by the Encyclopedia of DNA elements (ENCODE) pilot project using a combination of 5' rapid amplification of cDNA ends (RACE) and high-density resolution tiling arrays. We identified previously unannotated and often tissue- or cell-line-specific transcribed fragments (RACEfrags), both 5' distal to the annotated 5' terminus and internal to the annotated gene bounds for the vast majority (81.5%) of the tested genes. Half of the distal RACEfrags span large segments of genomic sequences away from the main portion of the coding transcript and often overlap with the upstream-annotated gene(s). Notably, at least 20% of the resultant novel transcripts have changes in their open reading frames (ORFs), most of them fusing ORFs of adjacent transcripts. A significant fraction of distal RACEfrags show expression levels comparable to those of known exons of the same locus, suggesting that they are not part of very minority splice forms. These results have significant implications concerning (1) our current understanding of the architecture of protein-coding genes; (2) our views on locations of regulatory regions in the genome; and (3) the interpretation of sequence polymorphisms mapping to regions hitherto considered to be "noncoding," ultimately relating to the identification of disease-related sequence alterations. [Abstract/Link to Full Text]

Rozowsky JS, Newburger D, Sayward F, Wu J, Jordan G, Korbel JO, Nagalakshmi U, Yang J, Zheng D, Guig� R, Gingeras TR, Weissman S, Miller P, Snyder M, Gerstein MB
The DART classification of unannotated transcription within the ENCODE regions: associating transcription with known and novel loci.
Genome Res. 2007 Jun;17(6):732-45.
For the approximately 1% of the human genome in the ENCODE regions, only about half of the transcriptionally active regions (TARs) identified with tiling microarrays correspond to annotated exons. Here we categorize this large amount of "unannotated transcription." We use a number of disparate features to classify the 6988 novel TARs-array expression profiles across cell lines and conditions, sequence composition, phylogenetic profiles (presence/absence of syntenic conservation across 17 species), and locations relative to genes. In the classification, we first filter out TARs with unusual sequence composition and those likely resulting from cross-hybridization. We then associate some of those remaining with proximal exons having correlated expression profiles. Finally, we cluster unclassified TARs into putative novel loci, based on similar expression and phylogenetic profiles. To encapsulate our classification, we construct a Database of Active Regions and Tools (DART.gersteinlab.org). DART has special facilities for rapidly handling and comparing many sets of TARs and their heterogeneous features, synchronizing across builds, and interfacing with other resources. Overall, we find that approximately 14% of the novel TARs can be associated with known genes, while approximately 21% can be clustered into approximately 200 novel loci. We observe that TARs associated with genes are enriched in the potential to form structural RNAs and many novel TAR clusters are associated with nearby promoters. To benchmark our classification, we design a set of experiments for testing the connectivity of novel TARs. Overall, we find that 18 of the 46 connections tested validate by RT-PCR and four of five sequenced PCR products confirm connectivity unambiguously. [Abstract/Link to Full Text]

Trinklein ND, Kara�z U, Wu J, Halees A, Force Aldred S, Collins PJ, Zheng D, Zhang ZD, Gerstein MB, Snyder M, Myers RM, Weng Z
Integrated analysis of experimental data sets reveals many novel promoters in 1% of the human genome.
Genome Res. 2007 Jun;17(6):720-31.
The regulation of transcriptional initiation in the human genome is a critical component of global gene regulation, but a complete catalog of human promoters currently does not exist. In order to identify regulatory regions, we developed four computational methods to integrate 129 sets of ENCODE-wide chromatin immunoprecipitation data. They collectively predicted 1393 regions. Roughly 47% of the regions were unique to one method, as each method makes different assumptions about the data. Overall, predicted regions tend to localize to highly conserved, DNase I hypersensitive, and actively transcribed regions in the genome. Interestingly, a significant portion of the regions overlaps with annotated 3'-UTRs, suggesting that some of them might regulate anti-sense transcription. The majority of the predicted regions are >2 kb away from the 5'-ends of previously annotated human cDNAs and hence are novel. These novel regions may regulate unannotated transcripts or may represent new alternative transcription start sites of known genes. We tested 163 such regions for promoter activity in four cell lines using transient transfection assays, and 25% of them showed transcriptional activity above background in at least one cell line. We also performed 5'-RACE experiments on 62 novel regions, and 76% of the regions were associated with the 5'-ends of at least two RACE products. Our results suggest that there are at least 35% more functional promoters in the human genome than currently annotated. [Abstract/Link to Full Text]

Rada-Iglesias A, Enroth S, Ameur A, Koch CM, Clelland GK, Respuela-Alonso P, Wilcox S, Dovey OM, Ellis PD, Langford CF, Dunham I, Komorowski J, Wadelius C
Butyrate mediates decrease of histone acetylation centered on transcription start sites and down-regulation of associated genes.
Genome Res. 2007 Jun;17(6):708-19.
Butyrate is a histone deacetylase inhibitor (HDACi) with anti-neoplastic properties, which theoretically reactivates epigenetically silenced genes by increasing global histone acetylation. However, recent studies indicate that a similar number or even more genes are down-regulated than up-regulated by this drug. We treated hepatocarcinoma HepG2 cells with butyrate and characterized the levels of acetylation at DNA-bound histones H3 and H4 by ChIP-chip along the ENCODE regions. In contrast to the global increases of histone acetylation, many genomic regions close to transcription start sites were deacetylated after butyrate exposure. In order to validate these findings, we found that both butyrate and trichostatin A treatment resulted in histone deacetylation at selected regions, while nucleosome loss or changes in histone H3 lysine 4 trimethylation (H3K4me3) did not occur in such locations. Furthermore, similar histone deacetylation events were observed when colon adenocarcinoma HT-29 cells were treated with butyrate. In addition, genes with deacetylated promoters were down-regulated by butyrate, and this was mediated at the transcriptional level by affecting RNA polymerase II (POLR2A) initiation/elongation. Finally, the global increase in acetylated histones was preferentially localized to the nuclear periphery, indicating that it might not be associated to euchromatin. Our results are significant for the evaluation of HDACi as anti-tumourogenic drugs, suggesting that previous models of action might need to be revised, and provides an explanation for the frequently observed repression of many genes during HDACi treatment. [Abstract/Link to Full Text]

Koch CM, Andrews RM, Flicek P, Dillon SC, Kara�z U, Clelland GK, Wilcox S, Beare DM, Fowler JC, Couttet P, James KD, Lefebvre GC, Bruce AW, Dovey OM, Ellis PD, Dhami P, Langford CF, Weng Z, Birney E, Carter NP, Vetrie D, Dunham I
The landscape of histone modifications across 1% of the human genome in five human cell lines.
Genome Res. 2007 Jun;17(6):691-707.
We generated high-resolution maps of histone H3 lysine 9/14 acetylation (H3ac), histone H4 lysine 5/8/12/16 acetylation (H4ac), and histone H3 at lysine 4 mono-, di-, and trimethylation (H3K4me1, H3K4me2, H3K4me3, respectively) across the ENCODE regions. Studying each modification in five human cell lines including the ENCODE Consortium common cell lines GM06990 (lymphoblastoid) and HeLa-S3, as well as K562, HFL-1, and MOLT4, we identified clear patterns of histone modification profiles with respect to genomic features. H3K4me3, H3K4me2, and H3ac modifications are tightly associated with the transcriptional start sites (TSSs) of genes, while H3K4me1 and H4ac have more widespread distributions. TSSs reveal characteristic patterns of both types of modification present and the position relative to TSSs. These patterns differ between active and inactive genes and in particular the state of H3K4me3 and H3ac modifications is highly predictive of gene activity. Away from TSSs, modification sites are enriched in H3K4me1 and relatively depleted in H3K4me3 and H3ac. Comparison between cell lines identified differences in the histone modification profiles associated with transcriptional differences between the cell lines. These results provide an overview of the functional relationship among histone modifications and gene expression in human cells. [Abstract/Link to Full Text]

Gingeras TR
Origin of phenotypes: genes and transcripts.
Genome Res. 2007 Jun;17(6):682-90.
While the concept of a gene has been helpful in defining the relationship of a portion of a genome to a phenotype, this traditional term may not be as useful as it once was. Currently, "gene" has come to refer principally to a genomic region producing a polyadenylated mRNA that encodes a protein. However, the recent emergence of a large collection of unannotated transcripts with apparently little protein coding capacity, collectively called transcripts of unknown function (TUFs), has begun to blur the physical boundaries and genomic organization of genic regions with noncoding transcripts often overlapping protein-coding genes on the same (sense) and opposite strand (antisense). Moreover, they are often located in intergenic regions, making the genic portions of the human genome an interleaved network of both annotated polyadenylated and nonpolyadenylated transcripts, including splice variants with novel 5' ends extending hundreds of kilobases. This complex transcriptional organization and other recently observed features of genomes argue for the reconsideration of the term "gene" and suggests that transcripts may be used to define the operational unit of a genome. [Abstract/Link to Full Text]

Gerstein MB, Bruce C, Rozowsky JS, Zheng D, Du J, Korbel JO, Emanuelsson O, Zhang ZD, Weissman S, Snyder M
What is a gene, post-ENCODE? History and updated definition.
Genome Res. 2007 Jun;17(6):669-81.
While sequencing of the human genome surprised us with how many protein-coding genes there are, it did not fundamentally change our perspective on what a gene is. In contrast, the complex patterns of dispersed regulation and pervasive transcription uncovered by the ENCODE project, together with non-genic conservation and the abundance of noncoding RNA genes, have challenged the notion of the gene. To illustrate this, we review the evolution of operational definitions of a gene over the past century--from the abstract elements of heredity of Mendel and Morgan to the present-day ORFs enumerated in the sequence databanks. We then summarize the current ENCODE findings and provide a computational metaphor for the complexity. Finally, we propose a tentative update to the definition of a gene: A gene is a union of genomic sequences encoding a coherent set of potentially overlapping functional products. Our definition side-steps the complexities of regulation and transcription by removing the former altogether from the definition and arguing that final, functional gene products (rather than intermediate transcripts) should be used to group together entities associated with a single gene. It also manifests how integral the concept of biological function is in defining genes. [Abstract/Link to Full Text]

Weinstock GM
ENCODE: more genomic empowerment.
Genome Res. 2007 Jun;17(6):667-8. [Abstract/Link to Full Text]

Kim JH, Waterman MS, Li LM
Diploid genome reconstruction of Ciona intestinalis and comparative analysis with Ciona savignyi.
Genome Res. 2007 Jul;17(7):1101-10.
One of the main goals in genome sequencing projects is to determine a haploid consensus sequence even when clone libraries are constructed from homologous chromosomes. However, it has been noticed that haplotypes can be inferred from genome assemblies by investigating phase conservation in sequenced reads. In this study, we seek to infer haplotypes, a diploid consensus sequence, from the genome assembly of an organism, Ciona intestinalis. The Ciona intestinalis genome is an ideal resource from which haplotypes can be inferred because of the high polymorphism rate (1.2%). The haplotype estimation scheme consists of polymorphism detection and phase estimation. The core step of our method is a Gibbs sampling procedure. The mate-pair information from two-end sequenced clone inserts is exploited to provide long-range continuity. We estimate the polymorphism rate of Ciona intestinalis to be 1.2% and 1.5%, according to two different polymorphism counting schemes. The distribution of heterozygosity number is well fit by a compound Poisson distribution. The N50 length of haplotype segments is 37.9 kb in our assembly, while the N50 scaffold length of the Ciona intestinalis assembly is 190 kb. We also infer diploid gene sequences from haplotype segments. According to our reconstruction, 85.4% of predicted gene sequences are continuously covered by single haplotype segments. Our results indicate 97% accuracy in haplotype estimation, based on a simulated data set. We conduct a comparative analysis with Ciona savignyi, and discover interesting patterns of conserved DNA elements in chordates. [Abstract/Link to Full Text]

Faux NG, Huttley GA, Mahmood K, Webb GI, de la Banda MG, Whisstock JC
RCPdb: An evolutionary classification and codon usage database for repeat-containing proteins.
Genome Res. 2007 Jul;17(7):1118-27.
Over 3% of human proteins contain single amino acid repeats (repeat-containing proteins, RCPs). Many repeats (homopeptides) localize to important proteins involved in transcription, and the expansion of certain repeats, in particular poly-Q and poly-A tracts, can also lead to the development of neurological diseases. Previous studies have suggested that the homopeptide makeup is a result of the presence of G+C-rich tracts in the encoding genes and that expansion occurs via replication slippage. Here, we have performed a large-scale genomic analysis of the variation of the genes encoding RCPs in 13 species and present these data in an online database (http://repeats.med.monash.edu.au/genetic_analysis/). This resource allows rapid comparison and analysis of RCPs, homopeptides, and their underlying genetic tracts across the eukaryotic species considered. We report three major findings. First, there is a bias for a small subset of codons being reiterated within homopeptides, and there is no G+C or A+T bias relative to the organism's transcriptome. Second, single base pair transversions from the homocodon are unusually common and may represent a mechanism of reducing the rate of homopeptide mutations. Third, homopeptides that are conserved across different species lie within regions that are under stronger purifying selection in contrast to nonconserved homopeptides. [Abstract/Link to Full Text]

Garg K, Green P
Differing patterns of selection in alternative and constitutive splice sites.
Genome Res. 2007 Jul;17(7):1015-22.
In addition to allowing identification of putative functional elements as regions having reduced substitution rates, comparison of genome sequences can also provide insights into these elements at the nucleotide level, by indicating the pattern of tolerated substitutions. We created data sets of orthologous alternative and constitutive splice sites in mouse, rat, and human and analyzed the substitutions occurring within them. Our results illuminate differences between alternative and constitutive sites and, in particular, strongly support the idea that alternative sites are under selection to be weak. [Abstract/Link to Full Text]

Sebaihia M, Peck MW, Minton NP, Thomson NR, Holden MT, Mitchell WJ, Carter AT, Bentley SD, Mason DR, Crossman L, Paul CJ, Ivens A, Wells-Bennik MH, Davis IJ, Cerde�o-T�rraga AM, Churcher C, Quail MA, Chillingworth T, Feltwell T, Fraser A, Goodhead I, Hance Z, Jagels K, Larke N, Maddison M, Moule S, Mungall K, Norbertczak H, Rabbinowitsch E, Sanders M, Simmonds M, White B, Whithead S, Parkhill J
Genome sequence of a proteolytic (Group I) Clostridium botulinum strain Hall A and comparative analysis of the clostridial genomes.
Genome Res. 2007 Jul;17(7):1082-92.
Clostridium botulinum is a heterogeneous Gram-positive species that comprises four genetically and physiologically distinct groups of bacteria that share the ability to produce botulinum neurotoxin, the most poisonous toxin known to man, and the causative agent of botulism, a severe disease of humans and animals. We report here the complete genome sequence of a representative of Group I (proteolytic) C. botulinum (strain Hall A, ATCC 3502). The genome consists of a chromosome (3,886,916 bp) and a plasmid (16,344 bp), which carry 3650 and 19 predicted genes, respectively. Consistent with the proteolytic phenotype of this strain, the genome harbors a large number of genes encoding secreted proteases and enzymes involved in uptake and metabolism of amino acids. The genome also reveals a hitherto unknown ability of C. botulinum to degrade chitin. There is a significant lack of recently acquired DNA, indicating a stable genomic content, in strong contrast to the fluid genome of Clostridium difficile, which can form longer-term relationships with its host. Overall, the genome indicates that C. botulinum is adapted to a saprophytic lifestyle both in soil and aquatic environments. This pathogen relies on its toxin to rapidly kill a wide range of prey species, and to gain access to nutrient sources, it releases a large number of extracellular enzymes to soften and destroy rotting or decayed tissues. [Abstract/Link to Full Text]

Belov K, Sanderson CE, Deakin JE, Wong ES, Assange D, McColl KA, Gout A, de Bono B, Barrow AD, Speed TP, Trowsdale J, Papenfuss AT
Characterization of the opossum immune genome provides insights into the evolution of the mammalian immune system.
Genome Res. 2007 Jul;17(7):982-91.
The availability of the first marsupial genome sequence has allowed us to characterize the immunome of the gray short-tailed opossum (Monodelphis domestica). Here we report the identification of key immune genes, including the highly divergent chemokines, defensins, cathelicidins, and Natural Killer cell receptors. It appears that the increase in complexity of the mammalian immune system occurred prior to the divergence of the marsupial and eutherian lineages approximately 180 million years ago. Genomes of ancestral mammals most likely contained all of the key mammalian immune gene families, with evolution on different continents, in the presence of different pathogens leading to lineage specific expansions and contractions, resulting in some minor differences in gene number and composition between different mammalian lineages. Gene expansion and extensive heterogeneity in opossum antimicrobial peptide genes may have evolved as a consequence of the newborn young needing to survive without an adaptive immune system in a pathogen laden environment. Given the similarities in the genomic architecture of the marsupial and eutherian immune systems, we propose that marsupials are ideal model organisms for the study of developmental immunology. [Abstract/Link to Full Text]

Carmel L, Rogozin IB, Wolf YI, Koonin EV
Evolutionarily conserved genes preferentially accumulate introns.
Genome Res. 2007 Jul;17(7):1045-50.
Introns that interrupt eukaryotic protein-coding sequences are generally thought to be nonfunctional. However, for reasons still poorly understood, positions of many introns are highly conserved in evolution. Previous reconstructions of intron gain and loss events during eukaryotic evolution used a variety of simplified evolutionary models that yielded contradicting conclusions and are not suited to reveal some of the key underlying processes. We combine a comprehensive probabilistic model and an extended data set, including 391 conserved genes from 19 eukaryotes, to uncover previously unnoticed aspects of intron evolution--in particular, to assign intron gain and loss rates to individual genes. The rates of intron gain and loss in a gene show moderate positive correlation. A gene's intron gain rate shows a highly significant negative correlation with the coding-sequence evolution rate; intron loss rate also significantly, but positively, correlates with the sequence evolution rate. Correlations of the opposite signs, albeit less significant ones, are observed between intron gain and loss rates and gene expression level. It is proposed that intron evolution includes a neutral component, which is manifest in the positive correlation between the gain and loss rates and a selection-driven component as reflected in the links between intron gain and loss and sequence evolution. The increased intron gain and decreased intron loss in evolutionarily conserved genes indicate that intron insertion often might be adaptive, whereas some of the intron losses might be deleterious. This apparent functional importance of introns is likely to be due, at least in part, to their multiple effects on gene expression. [Abstract/Link to Full Text]

Carmel L, Wolf YI, Rogozin IB, Koonin EV
Three distinct modes of intron dynamics in the evolution of eukaryotes.
Genome Res. 2007 Jul;17(7):1034-44.
Several contrasting scenarios have been proposed for the origin and evolution of spliceosomal introns, a hallmark of eukaryotic genes. A comprehensive probabilistic model to obtain a definitive reconstruction of intron evolution was developed and applied to 391 sets of conserved genes from 19 eukaryotic species. It is inferred that a relatively high intron density was reached early, i.e., the last common ancestor of eukaryotes contained >2.15 introns/kilobase, and the last common ancestor of multicellular life forms harbored approximately 3.4 introns/kilobase, a greater intron density than in most of the extant fungi and in some animals. The rates of intron gain and intron loss appear to have been dropping during the last approximately 1.3 billion years, with the decline in the gain rate being much steeper. Eukaryotic lineages exhibit three distinct modes of evolution of the intron-exon structure. The primary, balanced mode, apparently, operates in all lineages. In this mode, intron gain and loss are strongly and positively correlated, in contrast to previous reports on inverse correlation between these processes. The second mode involves an elevated rate of intron loss and is prevalent in several lineages, such as fungi and insects. The third mode, characterized by elevated rate of intron gain, is seen only in deep branches of the tree, indicating that bursts of intron invasion occurred at key points in eukaryotic evolution, such as the origin of animals. Intron dynamics could depend on multiple mechanisms, and in the balanced mode, gain and loss of introns might share common mechanistic features. [Abstract/Link to Full Text]

Oliver MJ, Petrov D, Ackerly D, Falkowski P, Schofield OM
The mode and tempo of genome size evolution in eukaryotes.
Genome Res. 2007 May;17(5):594-601.
Eukaryotic genome size varies over five orders of magnitude; however, the distribution is strongly skewed toward small values. Genome size is highly correlated to a number of phenotypic traits, suggesting that the relative lack of large genomes in eukaryotes is due to selective removal. Using phylogenetic contrasts, we show that the rate of genome size evolution is proportional to genome size, with the fastest rates occurring in the largest genomes. This trend is evident across the 20 major eukaryotic clades analyzed, indicating that over long time scales, proportional change is the dominant and universal mode of genome-size evolution in eukaryotes. Our results reveal that the evolution of eukaryotic genome size can be described by a simple proportional model of evolution. This model explains the skewed distribution of eukaryotic genome sizes without invoking strong selection against large genomes. [Abstract/Link to Full Text]

Recent Articles in Journal of Applied Genetics

Bobkowski W, Sobieszcza?ska M, Turska-Kmie? A, Nowak A, Jagielski J, Gonerska M, Lebioda A, Siwi?ska A
Mutation of the MYH7 gene in a child with hypertrophic cardiomyopathy and Wolff-Parkinson-White syndrome.
J Appl Genet. 2007;48(2):185-8.
Familial hypertrophic cardiomyopathy (HCM) displays autosomal dominant inheritance with incomplete penetration of defective genes. Data concerning the familial occurrence of ventricular preexcitation, i.e. Wolff-Parkinson-White (WPW) syndrome, also indicate autosomal dominant inheritance. In the literature, only a gene mutation on chromosome 7q3 has been described in familial HCM coexisting with WPW syndrome to date. The present paper describes the case of a 7-year-old boy with HCM and coexisting WPW syndrome. On his chromosome 14, molecular diagnostics revealed a C 9123 mutation (arginine changed into cysteine in position 453) in exon 14 in a copy of the gene for beta-myosin heavy chain (MYH7). It is the first known case of mutation of the MYH7 gene in a child with both HCM and WPW. Since no linkage between MYH7 mutation and HCM with WPW syndrome has been reported to date, we cannot conclude whether the observed mutation is a common cause for both diseases, or this patient presents an incidental co-occurrence of HCM (caused by MYH7 mutation) and WPW syndrome. [Abstract/Link to Full Text]

Borkowska E, Binka-Kowalska A, Constantinou M, Nawrocka A, Matych J, Ka?uzewski B
P53 mutations in urinary bladder cancer patients from Central Poland.
J Appl Genet. 2007;48(2):177-83.
The present study aimed at detection of P53 gene mutations in cells of urinary bladder neoplasms, as the mutations may be regarded as an independent prognostic factor for progression and recurrence of tumours. In the study, 82 patients with clinically diagnosed urinary bladder tumour were included. The control was composed of DNA samples from urine and blood of 202 healthy patients. Exons 5-8 of the P53 gene were screened for mutations by using multitemperature single-strand conformational polymorphism (MSSCP) analysis. Samples with abnormal MSSCP patterns were subjected to direct sequencing. The frequency of mutations in exons 5-8 of the P53 gene in patients with bladder cancer was lower (3.3% in grade G1, 24% in G2, and 39% in G3) than the data reported in the literature. We found a higher percentage of polymorphism at codon 213 of the P53 gene in bladder cancer patients (6%), compared with the values in the reference group (2.5%). These results were matched with those of the loss of heterozygosity (LOH) analysis. In conclusion, mutations were found mainly in more advanced histopathological and clinical stages of the disease and at the CIS stage (carcinoma in situ). It cannot be excluded that the observed polymorphism at codon 213 may be a predisposing factor for urinary bladder carcinoma development. [Abstract/Link to Full Text]

Pietrzak J, Mrasek K, Obersztyn E, Stankiewicz P, Kosyakova N, Weise A, Cheung SW, Cai WW, von Eggeling F, Mazurczak T, Bocian E, Liehr T
Molecular cytogenetic characterization of eight small supernumerary marker chromosomes originating from chromosomes 2, 4, 8, 18, and 21 in three patients.
J Appl Genet. 2007;48(2):167-75.
Small supernumerary marker chromosomes (sSMCs) are a morphologically heterogeneous group of additional structurally abnormal chromosomes that cannot be identified unambiguously by conventional banding techniques alone. Molecular cytogenetic methods enable detailed characterization of sSMCs; however, in many cases interpretation of their clinical significance is problematic. The aim of our study was to characterize precisely sSMCs identified in three patients with dysmorphic features, psychomotor retardation and multiple congenital anomalies. We also attempted to correlate the patients' genotypes with phenotypes by inclusion of data from the literature. The sSMCs were initially detected by G-banding analysis in peripheral blood lymphocytes in these patients and were subsequently characterized using multicolor fluorescence in situ hybridization (M-FISH), (sub)centromere-specific multicolor FISH (cenM-FISH, subcenM-FISH), and multicolor banding (MCB) techniques. Additionally, the sSMCs in two patients were also studied by hybridization to whole-genome bacterial artificial chromosome (BAC) arrays (array-CGH) to map the breakpoints on a single BAC clone level. In all three patients, the chromosome origin, structure, and euchromatin content of the sSMCs were determined. In patient RS, only a neocentric r(2)(q35q36) was identified. It is a second neocentric sSMC(2) in the literature and the first marker chromosome derived from the terminal part of 2q. In the other two patients, two sSMCs were found, as M-FISH detected additional sSMCs that could not be characterized in G-banding analysis. In patient MK, each of four cell lines contained der(4)(:p11.1-->q12:) accompanied by a sSMC(18): r(18)(:p11.2-->q11.1::p11.2-->q11.1:), inv dup(18)(:p11.1-->q11.1::q11.1-->p11.1:), or der(18) (:p11.2-->q11.1::q11.1-->p11.1:). In patient NP, with clinical features of trisomy 8p, three sSMCs were characterized: r(8)(:p12-->q11.1::q11.1-->p21:) der(8) (:p11.22-->q11.1::q11.1-->p21::p21-->p11.22:) and der(21)(:p11.1-->q21.3:). The BAC array results confirmed the molecular cytogenetic results and refined the breakpoints to the single BAC clone resolution. However, the complex mosaic structure of the marker chromosomes derived from chromosomes 8 and 18 could only be identified by molecular cytogenetic methods. This study confirms the usefulness of multicolor FISH combined with whole-genome arrays for comprehensive analyses of marker chromosomes. [Abstract/Link to Full Text]

Kowalczyk M, Srebniak M, Tomaszewska A
Chromosome abnormalities without phenotypic consequences.
J Appl Genet. 2007;48(2):157-66.
Some changes in chromosome morphology, detected during cytogenetic analysis, are not associated with clinical defects. Therefore a proper discrimination of harmless variants from true abnormalities, especially during prenatal diagnosis, is crucial to allow precise counseling. In this review we described chromosome variants and examples of chromosome anomalies that are considered to be unrelated to phenotypic consequences. The correlation between the presence of marker chromosomes and a risk of clinical signs is also discussed. Structural rearrangements of heterochromatic material, satellite polymorphism, or fragile sites, are well-known examples of common chromosome variation. However, the absence of clinical effects has also been reported in some cases of chromosome abnormalities concerning euchromatin. Such euchromatic anomalies were divided into 2 categories: unbalanced chromosome abnormalities (UBCAs), such as deletions or duplications, and euchromatic variants (EVs). Recently so-called molecular karyotyping, especially whole-genome screening by the use of high-resolution array-CGH technique, contributed to revealing a high number of previously unknown small genomic variations, which seem to be asymptomatic, as they are present in phenotypically normal individuals. [Abstract/Link to Full Text]

Patel RK, Singh KM, Soni KJ, Chauhan JB, Sambasiva Rao KR
Low incidence of bovine leukocyte adhesion deficiency (BLAD) carriers in Indian cattle and buffalo breeds.
J Appl Genet. 2007;48(2):153-5.
BLAD is an autosomal recessive genetic disease that affects Holstein-Friesian (HF) cattle worldwide. It is a disease characterized by a reduced expression of the adhesion molecules on neutrophils. The disease is caused by a mutation that replaces adenine at 383 with guanine, which causes an amino acid change from aspartic acid to glycine. Blood samples and a few semen samples were collected from 1250 phenotypically normal individuals, including HF (N=377), HF crossbred (N=334), Jersey (105), other breeds of cattle (N=160) and water buffalo Bubalus bubalis (N=274) belonging to various artificial insemination stations, bull mother farms (BMFs) and embryo transfer (ET) centres across the country. PCR-RFLP was performed to detect a point mutation in CD18, surface molecules of neutrophils. The results indicate that out of 1250 cattle and buffaloes tested for BLAD, 13 HF purebreds out of 377 and 10 HF crossbreds out of 334 appear to be BLAD carriers. In the HF and HF crossbred population, the percentage of BLAD carriers was estimated as 3.23%. The condition is alarming as the mutant gene has already entered the HF crossbred cattle population and therefore, the population of HF and its crossbreds needs regular screening to avoid the risk of spreading BLAD in the breeding cattle population of India. [Abstract/Link to Full Text]

Corr�a MJ, da Mota MD
Genetic evaluation of performance traits in Brazilian Quarter Horse.
J Appl Genet. 2007;48(2):145-51.
The aim of this study was to estimate genetic parameters for racing performance traits in Quarter Horses in Brazil. The data (provided by the Sorocaba Jockey Club) came from 3 Brazilian hippodromes in 1994-2003, with 11 875 observations of race time and 7775 of the speed index (SI), distributed in 2403 and 2169 races, respectively. The variance components were estimated by the MTGSAM program, under animal models including the random additive genetic effect, random permanent environmental effect, and the fixed effects of sex, age and race. Heritabilities for race time and the SI, for the 3 distances studied (301, 365 and 402 m), varied from 0.26 to 0.41 and from 0.14 to 0.19, respectively, whereas repeatabilities varied from 0.36 to 0.68 (time) and from 0.27 to 0.42 (SI) and the genetic correlations from 0.90 to 0.97 (time) and from 0.67 to 0.73 (SI). [Abstract/Link to Full Text]

Aksu S, Koczan D, Renne U, Thiesen HJ, Brockmann GA
Differentially expressed genes in adipose tissues of high body weight-selected (obese) and unselected (lean) mouse lines.
J Appl Genet. 2007;48(2):133-43.
Recently, quantitative trait loci (QTLs) for body weight and obesity have been mapped in an intercross population between the high body weight-selected mouse line DU6i and the inbred line DBA/2. Most QTLs were highly significant, but had small effects only. Under the hypothesis that small-effect QTLs might result from changes in gene activity, our strategy to identify candidate genes for the observed effects was directed towards the identification of differentially expressed genes. Therefore, here we compare the transcription profile of about 11 000 genes in epididymal fat tissues of males of two high body weight-selected (DU6 and DU6i) and two unselected mouse lines (DUKs and DBA/2). For the hybridisation of GeneChips, we used pooled samples of 20 individual mice. By pair-wise comparisons between selected and unselected mouse lines, a set of 77 genes was identified representing genes whose level of expression differed between obese and lean mouse strains. According to the functional classification of genes, 69 differentially expressed genes were involved in regulatory and metabolic pathways, cell division, cell stability, or immune response, and thus might have an effect on body weight and fat accumulation. 14 out of these genes, occur in QTL regions for body weight or abdominal fat weight. Further analyses are necessary to discriminate between genes directly causing QTL effects and indirectly regulated differentially expressed genes. [Abstract/Link to Full Text]

Gebler P, Wolko ?, Knaflewski M
Identification of molecular markers for selection of supermale (YY) asparagus plants.
J Appl Genet. 2007;48(2):129-31.
The research was aimed to elaborate a method for selection of male plants (XY, YY) and female ones (XX) as well as for identification of supermale genotypes (YY) among male phenotypes. The population obtained by self-pollination of andromonoecious plants was analysed. In order to identify the bands differentiating the male from the female genotypes, Bulk Segregant Analysis (BSA) was carried out. Primers identified by BSA analysis were used for RAPD amplification on the template of the male and female individuals. Among the products obtained by the use of primer OPB-20, some bands were linked with sex. A band of about 700 bp was found in all female plants, and in 4 phenotypically male specimens. In the male plants, the band showed a much lower intensity, compared with the female specimens. It seems that this fragment can be linked to the X chromosome in the investigated specimens. In the female specimens with XX karyotype, template duplication occurs and hence the band intensity is twice as high as in the XY karyotype. Three male plants did not include the OPB-20-700 fragment so they could potentially have the supermale (YY) karyotype. If the obtained marker proved its usefulness for identification of supermale plants, it could become a valuable tool facilitating breeding work. [Abstract/Link to Full Text]

Ariyarathna C, Gunasekare K
Genetic base of tea (Camellia sinensis L.) cultivars in Sri Lanka as revealed by pedigree analysis.
J Appl Genet. 2007;48(2):125-8.
An understanding of genetic diversity and relationships among breeding materials is a prerequisite for crop improvement. Coefficient of parentage (COP) can be used to measure the genetic diversity among genotypes on the basis of pedigree information. In the present study, COP was estimated for 56 cultivars, including commercial tea cultivars developed by the Tea Research Institute of Sri Lanka and their parental lines. Mean COP of the 56 accessions studied was 0.097 and the value was raised up to 0.272 when non-related pair-wise comparisons were excluded. A single cultivar (Assam/Cambod introduction) was the nucleus of the commercial cultivars. Group mean COP of the cultivars derived from Assam/Cambod parentage was 0.17. Thirty-three percent of the pair-wise comparisons had 0.00 COP, highlighting that many cultivars were unrelated. Within the pedigree, 2 major COP clusters were identified: Assam/Cambod open-pollinated half-sib progenies, and full-sib progenies derived from crosses between Assam/Cambod and other parental lines. The elite groups within the pedigree, where Assam/Cambod parentage was concentrated, were also identified. Information generated in this study should be useful for effective utilization of available diversity in future breeding programmes as well as for proper conservation of genetic diversity in the adapted germplasm. This is the first report on estimates of genetic diversity based on COP in a woody perennial crop, such as tea. [Abstract/Link to Full Text]

Escand�n AS, Zelener N, de la Torre MP, Soto S
Molecular identification of new varieties of Nierembergia linariaefolia (Graham), a native Argentinean ornamental plant.
J Appl Genet. 2007;48(2):115-23.
Six Nierembergia linariaefolia clones were selected for ornamental traits during a native germplasm development program. For fingerprinting diagnosis, 13 anchored inter-simple sequence repeat (ISSR) primers and 6 amplified fragment length polymorphism (AFLP) primer-enzyme combinations were used. Both markers revealed high levels of polymorphism, enabling genetic discrimination of the accessions analyzed by using 443 informative ISSRs and 541 AFLP markers. Both molecular techniques are suitable for monitoring genetic diversity in Nierembergia linariaefolia and, under our experimental conditions, they showed correlation coefficients of 0.629 for similarity matrices and of 0.649 in the cophenetic matrices. These results suggest that ISSRs are a good choice for DNA analysis in N. linariaefolia when simple manipulation and a low budget are required. [Abstract/Link to Full Text]

Branco CJ, Vieira EA, Malone G, Kopp MM, Malone E, Bernardes A, Mistura CC, Carvalho FI, Oliveira CA
IRAP and REMAP assessments of genetic similarity in rice.
J Appl Genet. 2007;48(2):107-13.
Rice is a model genome for cereal research, providing important information about genome structure and evolution. Retrotransposons are common components of grass genomes, showing activity at transcription, translation and integration levels. Their abundance and ability to transpose make them good potential markers. In this study, we used 2 multilocus PCR-based techniques that detect retrotransposon integration events in the genome: IRAP (inter-retrotransposon amplified polymorphism) and REMAP (retrotransposon-microsatellite amplified polymorphism). Markers derived from Tos17, a copia-like endogenous retrotransposon of rice, were used to identify genetic similarity among 51 rice cultivars (Oryza sativa L.). Genetic similarity analysis was performed by means of the Dice coefficient, and dendrograms were developed by using the average linkage distance method. A cophenetic correlation coefficient was also calculated. The clustering techniques revealed a good adjustment between matrices, with correlation coefficients of 0.74 and 0.80, or lower (0.21) but still significant, between IRAP and REMAP-based techniques. Consistent clusters were found for Japanese genotypes, while a subgroup clustered the irrigated Brazilian genotypes. [Abstract/Link to Full Text]

Juchimiuk J, Hering B, Maluszynska J
Multicolour FISH in an analysis of chromosome aberrations induced by N-nitroso-N-methylurea and maleic hydrazide in barley cells.
J Appl Genet. 2007;48(2):99-106.
The present study is a rare example of a detailed characterization of chromosomal aberrations by identification of individual chromosomes (or chromosome arms) involved in their formation in plant cells by using fluorescent in situ hybridization (FISH). In addition, the first application of more than 2 DNA probes in FISH experiments in order to analyse chromosomal aberrations in plant cells is presented. Simultaneous FISH with 5S and 25S rDNA and, after reprobing of preparations, telomeric and centromeric DNA sequences as probes, were used to compare the cytogenetic effects of 2 chemical mutagens: N-nitroso-N-methylurea (MNU) and maleic hydrazide (MH) on root tip meristem cells of Hordeum vulgare (2n=14). The micronucleus (MN) test combined with FISH allowed the quantitative analysis of the involvement of specific chromosome fragments in micronuclei formation and thus enabled the possible origin of mutagen-induced micronuclei to be explained. Terminal deletions were most frequently caused by MH and MNU. The analysis of the frequency of micronuclei with signals of the investigated DNA probes showed differences between the frequency of MH- and MNU-induced micronuclei with specific signals. The micronuclei with 2 signals, telomeric DNA and rDNA (5S and/or 25S rDNA), were the most frequently observed in the case of both mutagens, but with a higher frequency after treatment with MH (46%) than MNU (37%). Also, 10% of MH-induced micronuclei were characterized by the presence of only telomere DNA sequences, whereas there were almost 3-fold more in the case of MNU-induced micronuclei (28%). Additionally, by using FISH with the same probes, an attempt was made to identify the origin of chromosome fragments in mitotic anaphase. [Abstract/Link to Full Text]

Rivera H, V�squez-Vel�squez AI, Ramirez-Duenas Mde L, Becerra-Solano LE
A 9p13-->p24 duplication coupled with a whole 22q translocation onto 9p24.
J Appl Genet. 2007;48(1):95-8.
We report on a 3-year-old girl with a typical 9p trisomy syndrome, whose 45-chromosome karyotype includes a 9p+. As assessed by G, C and Ag-NOR bands, the rearranged chromosome resulted from a 9p13-->p24 direct duplication coupled with a translocation of the whole 22q onto 9pter, had heterochromatin at the junction site, lacked both nucleolar organizing regions (NORs) and centromere dots at the unconstricted fusion point, and was present in all metaphases scored. FISH results: a 9p subtelomere probe gave a diminished signal on the 9p+ precisely at the duplication junction 9p24::9p13, but no labeling was observed at the 9;22 translocation site; a pancentromeric alphoid probe labeled all centromeres, and gave a distinct signal at the 9pter;22cen junction. Hence, her karyotype was 45,XX,rea(9;22)(9qter-->9p24::9p13-->9p24::22p10-->22qter).ish rea(9;22) (9psubtel+dim,pancen+). Parental chromosomes were normal. The distinctiveness of the present centromere-telomere fusion rests on the coupling of an intrachromosomal distal duplication with a whole-arm translocation including alphoid DNA onto the duplicated segment. The centromeric inertia of the residual alphoid DNA in the present case compares with the variable functional status of the chromosome 22 centromere in true heterodicentrics involving such a chromosome. [Abstract/Link to Full Text]

Salahshourifar I, Gilani MA, Vosough A, Tavakolzadeh T, Tahsili M, Mansori Z, Karimi H, Totonchi M, Gourabi H
De novo complex chromosomal rearrangement of 46, XY, t (3; 16; 8) (p26; q13; q21.2) in a non-obstructive azoospermic male.
J Appl Genet. 2007;48(1):93-4.
Complex Chromosomal Rearrangements (CCRs) are rare structural abnormalities that are usually associated with infertility or subfertility in male carriers. We described clinical and chromosomal features of a non-obstructive azoospermic male that has been referred for infertility. Cytogenetic analysis showed three chromosomes, i.e. 3, 8 and 16, which have been involved and caused spermatogenesis failure. [Abstract/Link to Full Text]

Mork?nien� A, Steponavici?t D, Utkus A, Kucinskas V
Few associations of candidate genes with nonsyndromic orofacial clefts in the population of Lithuania.
J Appl Genet. 2007;48(1):89-91.
Nonsyndromic orofacial clefting (NS-OFC) is a common complex multifactorial trait with a considerable genetic component and a number of candidate genes suggested by various approaches. Twenty biallelic and microsatellite DNA markers in the strong candidate loci TGFA, TGFB3, GABRB3, RARA, and BCL3 were analysed for allelic association with the NS-OFC phenotype in 112 nuclear families (proband + both parents) from Lithuania by using the transmission disequilibrium test (TDT). Associations were found between the TGFA gene marker rs2166975 and nonsyndromic cleft palate only (CPO) phenotype (p = 0.045, df 1) as well as between the D2S292 marker and the cleft lip with or without cleft palate (CL/CP) phenotype in allele-wise TDT (P = 0.005, df 9) and genotype-wise TDT (P = 0.021, df 24). A weak association (P = 0.085, df 3) of the BCL3 marker (BCL3 gene) with the risk of CPO was also found. Thus our initial results support the contribution of allelic variation in the TGFA locus to the aetiology of CL/CP in the population of Lithuania but they do not point to TGFA as a major causal gene. Different roles of the TGFA and BCL3 genes in the susceptibility to NS-OFC phenotypes are suggested. [Abstract/Link to Full Text]

Skrzypczak U, Rutkiewicz E, Pogorzelski A, Witt M, Zietkiewicz E
Carrier status for 3 most frequent CFTR mutations in Polish PCD/KS patients: lack of association with the primary ciliary dyskinesia phenotype.
J Appl Genet. 2007;48(1):85-8.
We screened a large group of primary ciliary dyskinesia/Kartagener syndrome (PCD/KS) patients and their siblings (148 patients from 126 unrelated families) for the presence of the CFTR mutations that are most frequently found in the Polish population: the severe F508del and 2,3del21kb, and the mild 3849+10kbC > T. No statistically significant increase in the frequency of these mutations was found in the studied group, as compared with the general population. This is consistent with an earlier observation in another population and indicates that the status of being a carrier of any of these CFTR mutations should not be considered as an important risk factor in PCD/KS pathogenesis. [Abstract/Link to Full Text]

J�?kowska J, Derwich K, Dawidowska M
Methods of minimal residual disease (MRD) detection in childhood haematological malignancies.
J Appl Genet. 2007;48(1):77-83.
The appropriate management of haematological disorders must rely on a precise and long-term monitoring of the patient's response to chemotherapy and radiotherapy. Clinical data are not sufficient and that is why in the last decade it became the most important to improve the knowledge of haematological diseases on the basis of molecular techniques and molecular markers. The presence of residual malignant cells among normal cells is termed minimal residual disease (MRD). Nowadays a great progress has been made in the treatment of malignant diseases and in the development of reliable molecular techniques, which are characterised by high sensitivity (10-3- 10-6) and ability to distinguish between normal and malignant cells at diagnosis and during follow-up. Especially, MRD data based on quantitative analysis (RQ-PCR, RT-RQ-PCR) appear to be crucial for appropriate evaluation of treatment response in many haematological malignancies. Implementation of standardized approaches for MRD assessment into routine molecular diagnostics available in all oncohaematological centres should be regarded nowadays a crucial point in further MRD study development. [Abstract/Link to Full Text]

Szczerbal I, Lin L, Stachowiak M, Chmurzynska A, Mackowski M, Winter A, Flisikowski K, Fries R, Switonski M
Cytogenetic mapping of DGAT1, PPARA, ADIPOR1 and CREB genes in the pig.
J Appl Genet. 2007;48(1):73-6.
In the present study we show FISH localization of 4 porcine BAC clones harbouring potential candidate genes for fatness traits: DGAT1 (SSC4p15), PPARA (SSC5p15), ADIPOR1 (SSC10p13) and CREB (SSC15q24). Until now the CREB and ADIPOR1 genes are considered to be monomorphic, DGAT1 is highly polymorphic, while for the PPARA gene only 1 SNP was identified. Assignment of the studied genes in relation to QTL chromosome regions for meat quality in pig chromosomes SSC4, SSC5, SSC10 and SSC15 is discussed. [Abstract/Link to Full Text]

Czarnik U, Zabolewicz T, Strychalski J, Grzybowski G, Bogusz M, Walawski K
Deletion/insertion polymorphism of the prion protein gene (PRNP) in Polish Holstein-Friesian cattle.
J Appl Genet. 2007;48(1):69-71.
The aim of the present study was to identify the deletion/insertion polymorphism of the bovine prion protein gene (PRNP) within the promoter sequence (23 bp), intron 1 (12 bp) and 3' untranslated region (14 bp). DNA was isolated from blood of 234 randomly tested Polish Holstein-Friesian cows and from semen of 47 sires used for artificial insemination (AI) in 2004. No statistically significant differences were found in the frequency of genotypes and alleles between cows and breeding bulls in the 3 analysed polymorphic sites within the PRNP gene. Only 3 haplotypes were identified in sires and 4 haplotypes in cows. [Abstract/Link to Full Text]

Chung H, Choi B, Jang G, Lee K, Kim H, Yoon S, Im S, Davis M, Hines H
Effect of variants in the ovine skeletal-muscle-specific calpain gene on body weight.
J Appl Genet. 2007;48(1):61-8.
The ovine skeletal-muscle-specific calpain gene (p94), which is known also as the n-calpain or calpain 3 gene (CAPN3), was screened with primers. Selection of the PCR primers was based on the ovine cDNA sequence (GenBank accession No. AF087570). After sequence alignment between the ovine and human (AY902237) genes, exon and intron boundaries were determined. Polymorphisms were observed in the intron region for the CAPN31112 and CAPN31213 segments, and the sequences for these segments were submitted to the GenBank (AF309635 and AY102617, respectively). Body weight was recorded at birth, weaning and post-weaning. Calpain 3 genotypes of the CAPN31112 segment were associated with birth weight (P < 0.01), and a dominant gene effect was observed. Breeding group, birth type, and rearing type were significantly associated with weight traits. Allele frequencies were similar in purebred and crossbred animals. [Abstract/Link to Full Text]

Melo EO, Canavessi AM, Franco MM, Rumpf R
Animal transgenesis: state of the art and applications.
J Appl Genet. 2007;48(1):47-61.
There is a constant expectation for fast improvement of livestock production and human health care products. The advent of DNA recombinant technology and the possibility of gene transfer between organisms of distinct species, or even distinct phylogenic kingdoms, has opened a wide range of possibilities. Nowadays we can produce human insulin in bacteria or human coagulation factors in cattle milk. The recent advances in gene transfer, animal cloning, and assisted reproductive techniques have partly fulfilled the expectation in the field of livestock transgenesis. This paper reviews the recent advances and applications of transgenesis in livestock and their derivative products. At first, the state of art and the techniques that enhance the efficiency of livestock transgenesis are presented. The consequent reduction in the cost and time necessary to reach a final product has enabled the multiplication of transgenic prototypes around the world. We also analyze here some emerging applications of livestock transgenesis in the field of pharmacology, meat and dairy industry, xenotransplantation, and human disease modeling. Finally, some bioethical and commercial concerns raised by the transgenesis applications are discussed. [Abstract/Link to Full Text]

Feng J, Zhang Z, Li G, Zhou Y, Wang H, Guo Q, Sun J
Inheritance of resistance to stripe rust in winter wheat cultivars Aquileja and Xian Nong 4.
J Appl Genet. 2007;48(1):43-6.
Winter wheat cultivars Aquileja (AQ) and Xian Nong 4(XN) were previously reported to possess durable, quantitative resistance to stripe rust disease. In the present study, AQ, XN and a susceptible wheat cultivar were reciprocally crossed in all 6 combinations. Parents, F1, F2, F3, BCP1 and BCP2 were used to determine quantitative genetic parameters for infection type and disease severity. The results showed that fixable genetic components preponderated in the inheritance of the resistance in AQ and XN for both infection type and disease severity, while the dominant component could be detected in some cases. The resistance was conditioned by oligogenes. Heritability of the resistance ranged from 50 to 79% in most cases. [Abstract/Link to Full Text]

Wang HY, Wei YM, Yan ZH, Zheng YL
EST-SSR DNA polymorphism in durum wheat (Triticum durum L.) collections.
J Appl Genet. 2007;48(1):35-42.
SSRs derived from EST were molecular markers belonging to the transcribed region of the genome. Therefore, any polymorphism detected using EST-SSRs might reflect the better relationship among species or varieties. Using wheat EST-SSR markers, 60 durum wheat (Triticum durum L.) accessions from seven countries were investigated. Twenty-five primer pairs could amplify successfully in the 60 durum wheat accessions, of which tri-nucleotide repeats were the dominant type, and revealed 26 loci on all seven wheat homologous chromosome groups. A total of 87 eSSR alleles were detected, and the number of alleles detected by a single pair of primers ranged from 1 to 11, with an average of 3.3 alleles per locus. Higher numbers of alleles and PIC were identified on the B genome than those on the A genome. [Abstract/Link to Full Text]

Akond MA, Watanabe N, Furuta Y
Exploration of genetic diversity among Xinjiang Triticum and Triticum polonicum by AFLP markers.
J Appl Genet. 2007;48(1):25-33.
Seventy-two Xinjiang Triticum and Triticum polonicum accessions were subjected to AFLP analyses to discuss the origin of Triticum petropavlovskyi. A total of 91 putative loci were produced by four primer combinations. Among them 56 loci were polymorphic, which is equivalent to 61.53 % of the total number of putative loci. Genetic diversity among 11 T. petropavlovskyi accessions was narrow due to the lowest number (32) of polymorphic loci among the wheat species. Forty four polymorphic loci were found in T. aestivum and T. compactum, whereas the highest polymorphism was observed in T. polonicum. On the basis of the UPGMA clustering and PCO grouping and genetic similarity estimates from the AFLPs, we noted that T. petropavlovskyi was more closely related to the Chinese accessions of T. polonicum than to T. polonicum from other countries. Two accessions of T. aestivum were grouped with T. petropavlovskyi in the UPGMA clustering. Both of them were similar to T. petropavlovskyi in respect of spike structure, i.e. the presence of awn, glume awn and also the presence of leaf pubescence. Six loci, which were commonly absent in Chinese T. polonicum, were also absent in almost all of the T. petropavlovskyi accessions. Findings of this study reduced the probability of an independent allopolyploidization event in the origin of T. petropavlovskyi and indicated a greater degree of gene flow between T. aestivum and T. polonicum leading to T. petropavlovskyi. It is most likely that the P-gene of T. petropavlovskyi hexaploid wheat was introduced from T. polonicum to T. aestivum via a spontaneous introgression or breeding effort. [Abstract/Link to Full Text]

Milczarski P, Banek-Tabor A, Lebiecka K, Stoja?owski S, My?k�w B, Masoj? P
New genetic map of rye composed of PCR-based molecular markers and its alignment with the reference map of the DS2 x RXL10 intercross.
J Appl Genet. 2007;48(1):11-24.
A new genetic map of rye, developed by using the 541 x Ot1-3 F2 intercross, consists of 148 marker loci, including 99 RAPDs, 18 SSRs, 14 STSs, 9 SCARs and 7 ISSRs, and spans the distance of 1401.4 cM. To the 7 rye chromosomes, 8 linkage groups were assigned and compared with the reference map of the DS2 x RXL10 F2 intercross by using 24 common markers. The 2 combined maps contain altogether 611 marker loci (70-109 per chromosome) and constitute a substantial source of information useful for further genomic studies in rye. From 21 to 37 RAPD marker loci are distributed randomly along each chromosome length and their total number for all 7 rye chromosomes is 177. This abundance of RAPD marker loci in the rye genetic map can be exploited for development of SCARs in regions containing important genes or QTL. [Abstract/Link to Full Text]

Bartoszewski G, Havey MJ, Zi�?kowska A, D?ugosz M, Malepszy S
The selection of mosaic (MSC) phenotype after passage of cucumber (Cucumis sativus L.) through cell culture - a method to obtain plant mitochondrial mutants.
J Appl Genet. 2007;48(1):1-9.
Mosaic (MSC) mutants of cucumber (Cucumis sativus L.) appear after passage through cell cultures. The MSC phenotype shows paternal transmission and is associated with mitochondrial DNA rearrangements. This review describes the origins and phenotypes of independently produced MSC mutants of cucumber, including current knowledge on their mitochondrial DNA rearrangements, and similarities of MSC with other plant mitochondrial mutants. Finally we propose that passage of cucumber through cell culture can be used as a unique and efficient method to generate mitochondrial mutants of a higher plant in a highly homozygous nuclear background. [Abstract/Link to Full Text]

Ramegowda S, Gawde HM, Hyderi A, Savitha MR, Patel ZM, Krishnamurthy B, Ramachandra NB
De novo isochromosome 18p in a female dysmorphic child.
J Appl Genet. 2006;47(4):397-401.
Isochromosome 18p results in tetrasomy 18p. Most of the i(18p) cases reported so far in the literature are sporadic due to de novo formation, while familial and mosaic cases are infrequent. It is a rare chromosomal abnormality, occurring once in every 140,000 livebirths, affecting males and females equally. In the present investigation, we report a de novo i(18p) in a female dysmorphic child. The small metacentric marker chromosome was confirmed as i(18p) in the proband by cytogenetic and FISH analysis [47,XX+i(18p)]. Cytogenetic investigations in the family members revealed normal chromosome numbers, indicating the case as a de novo event of i(18p) formation. It could be due to the somewhat advanced maternal age (32 years) and/or expression of recessive genes in the proband, who is the progeny of consanguineous marriage, which could have led to misdivision and nondisjunction of chromosome 18 in meiosis I, followed by failure in the chromatid separation of 18p in meiosis II and by inverted duplication. [Abstract/Link to Full Text]

Sankar VH, Arya V, Tewari D, Gupta UR, Pradhan M, Agarwal S
Genotyping of alpha-thalassemia in microcytic hypochromic anemia patients from North India.
J Appl Genet. 2006;47(4):391-5.
Microcytic hypochromic anemia is a common condition in clinical practice and alpha-thalassemia has to be considered as a differential diagnosis. Molecular diagnosis of alpha-thalassemia is possible by polymerase chain reaction. The aim of this study was to evaluate the frequency of alpha-gene numbers in subjects with microcytosis. In total, 276 subjects with microcytic hypochromic anemia [MCV<80fl; MCH<27pg] were studied. These include 125 with thalassemia trait, 48 with thalassemia major, 26 with sickle-cell thalassemia, 15 with E beta-thalassemia, 40 with iron-deficiency anemia, 8 with another hemolytic anemia, and 14 patients with no definite diagnosis. Genotyping for -alpha3.7 deletion, -alpha4.2 deletion, Hb Constant Spring, and a-triplications was done with polymerase chain reaction. The overall frequency of -alpha3.7 deletion in 276 individuals is 12.7%. The calculated allele frequency for a-thalassemia is 0.09. The subgroup analysis showed that co-inheritance of a-deletion is more frequent with the sickle-cell mutation than in other groups. We were able to diagnose 1/3 of unexplained cases of microcytosis as a-thalassemia carriers. The a-gene mutation is quite common in the Indian subcontinent. Molecular genotyping of a-thalassemia helps to diagnose unexplained microcytosis, and thus prevents unnecessary iron supplementation. [Abstract/Link to Full Text]

Binczak-Kuleta A, Rozanski J, Domanski L, Myslak M, Ciechanowski K, Ciechanowicz A
DNA microsatellite analysis in families with autosomal dominant polycystic kidney disease (ADPKD): the first Polish study.
J Appl Genet. 2006;47(4):383-9.
BACKGROUND: Autosomal dominant polycystic kidney disease (ADPKD) is one of the most common inherited renal disorders with genetic heterogeneity. Mutations of two known genes are responsible for this disease: PKD1 at 16p13.3 and PKD2 at 4q21-23. A majority of cases (85%) are caused by mutations in PKD1. Because direct mutation screening remains complex, we describe here the application of an efficient approach to studies based on highly informative dinucleotide and tetranucleotide repeats flanking genes PKD1 and PKD2. METHODS: For this study a series of microsatellites closely linked to locus PKD1 (D16S291, D16S663, D16S665, D16S283, D16S407, D16S475) and to locus PKD2 (D4S1563, D4S2929, D4S414, D4S1534, D4S423) were selected. Short (81-242 bp) DNA fragments containing the tandem repeats were amplified by polymerase chain reaction (PCR). The number of repeat units of microsatelite markers was determined by fluorescent capillary electrophoresis. RESULTS: DNA microsatellite analysis was performed in 25 Polish ADPKD families and established the type of disease (21 families PKD1-type, 1 family PKD2-type). CONCLUSIONS: While a disease-causing mutation in the PKD1 and PKD2 genes cannot be identified, DNA microsatellite analysis provided an early diagnosis and may be considered in ADPKD families. [Abstract/Link to Full Text]

Urbina-Cano P, Bobadilla-Morales L, Ram�rez-Herrera MA, Corona-Rivera JR, Mendoza-Maga�a ML, Troyo-Sanrom�n R, Corona-Rivera A
DNA damage in mouse lymphocytes exposed to curcumin and copper.
J Appl Genet. 2006;47(4):377-82.
Dietary polyphenolics, such as curcumin, have shown antioxidant and anti-inflammatory effects. Some antioxidants cause DNA strand breaks in excess of transition metal ions, such as copper. The aim of this study was to evaluate the in vitro effect of curcumin in the presence of increasing concentrations of copper to induce DNA damage in murine leukocytes by the comet assay. Balb-C mouse lymphocytes were exposed to 50 microM curcumin and various concentrations of copper (10 microM, 100 microM and 200 microM). Cellular DNA damage was detected by means of the alkaline comet assay. Our results show that 50 microM curcumin in the presence of 100-200 microM copper induced DNA damage in murine lymphocytes. Curcumin did not inhibit the oxidative DNA damage caused by 50 microM H2O2 in mouse lymphocytes. Moreover, 50 microM curcumin alone was capable of inducing DNA strand breaks under the tested conditions. The increased DNA damage by 50 mM curcumin was observed in the presence of various concentrations of copper, as detected by the alkaline comet assay. [Abstract/Link to Full Text]

Recent Articles in Genetics and Molecular Research

Arruda JT, Bordin BM, Santos PR, Mesquita WE, Silva RC, Maia MC, Approbato MS, Flor�ncio RS, Amaral WN, Rocha Filho MA, Moura KK
Y chromosome microdeletions in Brazilian fertility clinic patients.
Genet Mol Res. 2007;6(2):461-9.
Microdeletions in Yq are associated with defects in spermatogenesis, while those in the AZF region are considered critical for germ cell development. We examined microdeletions in the Y chromosomes of patients attended at the Laboratory of Human Reproduction of the Clinical Hospital of the Federal University of Goi�s as part of a screening of patients who plan to undergo assisted reproduction. Analysis was made of the AZF region of the Y chromosome in men who had altered spermograms to detect possible microdeletions in Yq. Twenty-three patients with azoospermia and 40 with severe oligozoospermia were analyzed by PCR for the detection of six sequence-tagged sites: sY84 and sY86 for AZFa, sY127 and sY134 for AZFb, and sY254 and sY255 for AZFc. Microdeletions were detected in 28 patients, including 10 azoospermics and 18 severe oligozoospermics. The patients with azoospermia had 43.4% of their microdeletions in the AZFa region, 8.6% in the AZFb region and 17.4% in the AZFc region. In the severe oligozoospermics, 40% were in the AZFa region, 5% in the AZFb region and 5% in the AZFc region. We conclude that microdeletions can be the cause of idiopathic male infertility, supporting conclusions from previous studies. [Abstract/Link to Full Text]

Ondei LS, Zamaro PJ, Mangonaro PH, Val�ncio CR, Bonini-Domingos CR
HPLC determination of hemoglobins to establish reference values with the aid of statistics and informatics.
Genet Mol Res. 2007;6(2):453-60.
The purpose of the present study was to establish reference values for hemoglobins (Hb) using HPLC, in samples containing normal Hb (AA), sickle cell trait without alpha-thalassemia (AS), sickle cell trait with alpha-thalassemia (ASH), sickle cell anemia (SS), and Hb SC disease (SC). The blood samples were analyzed by electrophoresis, HPLC and molecular procedures. The Hb A2 mean was 4.30 +/- 0.44% in AS, 4.18 +/- 0.42% in ASH, 3.90 +/- 1.14% in SS, and 4.39 +/- 0.35% in SC. They were similar, but above the normal range. Between the AS and ASH groups, only the amount of Hb S was higher in the AS group. The Hb S mean in the AS group was 38.54 +/- 3.01% and in the ASH it was 36.54 +/- 3.76%. In the qualitative analysis, using FastMap, distinct groups were seen: AA and SS located at opposite extremes, AS and ASH with overlapping values and intermediate distribution, SC between heterozygotes and the SS group. Hb S was confirmed by allele-specific polymerase chain reaction. The Hb values established will be available for use as a reference for the Brazilian population, drawing attention to the increased levels of Hb A2, which should be considered with caution to prevent incorrect diagnoses. [Abstract/Link to Full Text]

Abud S, de Souza PI, Vianna GR, Leonardecz E, Moreira CT, Faleiro FG, J�nior JN, Monteiro PM, Rech EL, Arag�o FJ
Gene flow from transgenic to nontransgenic soybean plants in the Cerrado region of Brazil.
Genet Mol Res. 2007;6(2):445-52.
Evaluation of transgenic crops under field conditions is a fundamental step for the production of genetically engineered varieties. In order to determine if there is pollen dispersal from transgenic to nontransgenic soybean plants, a field release experiment was conducted in the Cerrado region of Brazil. Nontransgenic plants were cultivated in plots surrounding Roundup Ready transgenic plants carrying the cp4 epsps gene, which confers herbicide tolerance against glyphosate herbicide, and pollen dispersal was evaluated by checking for the dominant gene. The percentage of cross-pollination was calculated as a fraction of herbicide-tolerant and -nontolerant plants. The greatest amount of transgenic pollen dispersion was observed in the first row, located at one meter from the central (transgenic) plot, with a 0.52% average frequency. The frequency of pollen dispersion decreased to 0.12% in row 2, reaching 0% when the plants were up to 10 m distance from the central plot. Under these conditions pollen flow was higher for a short distance. This fact suggests that the management necessary to avoid cross-pollination from transgenic to nontransgenic plants in the seed production fields should be similar to the procedures currently utilized to produce commercial seeds. [Abstract/Link to Full Text]

DeGroot BJ, Keown JF, Van Vleck LD, Kachman SD
Estimates of genetic parameters for Holstein cows for test-day yield traits with a random regression cubic spline model.
Genet Mol Res. 2007;6(2):434-44.
Genetic parameters were estimated with restricted maximum likelihood for individual test-day milk, fat, and protein yields and somatic cell scores with a random regression cubic spline model. Test-day records of Holstein cows that calved from 1994 through early 1999 were obtained from Dairy Records Management Systems in Raleigh, North Carolina, for the analysis. Estimates of heritability for individual test-days and estimates of genetic and phenotypic correlations between test-days were obtained from estimates of variances and covariances from the cubic spline analysis. Estimates were calculated of genetic parameters for the averages of the test days within each of the ten 30-day test intervals. The model included herd test-day, age at first calving, and bovine somatropin treatment as fixed factors. Cubic splines were fitted for the overall lactation curve and for random additive genetic and permanent environmental effects, with five predetermined knots or four intervals between days 0, 50, 135, 220, and 305. Estimates of heritability for lactation one ranged from 0.10 to 0.15, 0.06 to 0.10, 0.09 to 0.15, and 0.02 to 0.06 for test-day one to test-day 10 for milk, fat, and protein yields and somatic cell scores, respectively. Estimates of heritability were greater in lactations two and three. Estimates of heritability increased over the course of the lactation. Estimates of genetic and phenotypic correlations were smaller for test-days further apart. [Abstract/Link to Full Text]

Mazz� FM, Fuzo CA, Ciancaglini P, Degr�ve L
Recognition of alpha-helix transmembrane domains with an amphipathy scale generated by molecular dynamics using only the primary sequence of proteins.
Genet Mol Res. 2007;6(2):422-33.
We recently developed an amphipathy scale, elaborated from molecular dynamics data that can be used for the identification of hydrophobic or hydrophilic regions in proteins. This amphipathy scale reflects side chain/water molecule interaction energies. We have now used this amphipathy scale to find candidates for transmembrane segments, by examining a large sample of membrane proteins with alpha-helix segments. The candidates were selected based on an amphipathy coefficient value range and the minimum number of residues in a segment. We compared our results with the transmembrane segments previously identified in the PDB_TM database by the TMDET algorithm. We expected that the hydrophobic segments would be identified using only the primary structures of the proteins and the amphipathy scale. However, some of these hydrophobic segments may pertain to hydrophobic pockets not included in transmembrane regions. We found that our amphipathy scale could identify alpha-helix transmembrane regions with a probability of success of 76% when all segments were included and 90% when all membrane proteins were included. [Abstract/Link to Full Text]

Bonini-Domingos CR, Silva MB, Romero RM, Zamaro PJ, Ondei LS, Zago CE, Moreira SB, Salgado CG
Description of electrophoretic and chromatographic hemoglobin profile of Rhinoclemmys punctularia.
Genet Mol Res. 2007;6(2):415-21.
Studies of the hemoglobin pattern in Brazilian reptiles are important for determining ecological and phylogenetic relationships, but they are scarce. Peripheral blood samples were obtained from 7 males and 18 females of Rhinoclemmys punctularia. The hematological profile was based on the total hemoglobin and hematocrit values. The hemoglobin profile was obtained using electrophoretic procedures at different pH, isoelectric focusing, globin chain electrophoresis, and HPLC. The hematocrit (31 +/- 2%) and total hemoglobin (7.5 +/- 0.2 g/dL) values did not indicate gender variations. Alkaline pH electrophoresis of the total blood samples treated with 1% saponin demonstrated the presence of four well-defined hemoglobin fractions, one major component (fraction I), showing cathodic migration and three others faster than fraction I with anodic migration. When the samples were precipitated with chloroform, only two hemoglobin fractions were observed, similar to fractions I and III from the first procedure. Isoelectric focusing and HPLC showed the same pattern. With acid and neutral pH electrophoresis, two fractions with anodic migration were observed. The globin chain identification at alkaline pH showed two fractions, but four fractions were observed at acidic pH, suggesting that different polypeptide chains are involved in the hemoglobin molecule. The chromatographic separation of the total blood sample demonstrated that the major fraction comprised 81.9% and the minor 18.1%. The results obtained demonstrated a similarity between these hemoglobin components and those of some Chelidae reported in the literature for both land and aquatic animals, reflecting the adaptation to environmental conditions. [Abstract/Link to Full Text]

Ferreira RC, Bosco F, Paiva PB, Briones MR
Minimization of transcriptional temporal noise and scale invariance in the yeast genome.
Genet Mol Res. 2007;6(2):297-314.
The analysis of transcriptional temporal noise could be an interesting means to study gene expression dynamics and stochasticity in eukaryotes. To study the statistical distributions of temporal noise in the eukaryotic model system Saccharomyces cerevisiae, we analyzed microarray data corresponding to one cell cycle for 6200 genes. We found that the temporal noise follows a lognormal distribution with scale invariance at the genome, chromosomal and sub-chromosomal levels. Correlation of temporal noise with the codon adaptation index suggests that at least 70% of all protein-coding genes are a noise minimization core of the genome. Accordingly, a mathematical model of individual gene expression dynamics was proposed, using an operator theoretical approach, which reveals strict conditions for noise variability and a possible global noise minimization/optimization strategy at the genome level. Our model and data show that minimal noise does not correspond to genes obeying a strictly deterministic dynamics. The natural strategy of minimization consists in equating the mean of the absolute value of the relative variation of the expression level (alpha) with noise (eta). We hypothesize that the temporal noise pattern is an emergent property of the genome and shows how the dynamics of gene expression could be related to chromosomal organization. [Abstract/Link to Full Text]

Tannure-Nascimento IC, Nascimento FS, Turatti IC, Lopes NP, Trigo JR, Zucchi R
Colony membership is reflected by variations in cuticular hydrocarbon profile in a Neotropical paper wasp, Polistes satan (Hymenoptera, Vespidae).
Genet Mol Res. 2007;6(2):290-6.
Nestmate recognition is one the most important features in social insect colonies. Although epicuticular lipids or cuticular hydrocarbons have both structural and defensive functions in insects, they also seem to be involved in several aspects of communication in wasps, bees and ants. We analyzed and described for the first time the cuticular hydrocarbons of a Neotropical paper wasp, Polistes satan, and found that variation in hydrocarbon profile was sufficiently strong to discriminate individuals according to their colony membership. Therefore, it seems that small differences in the proportion of these compounds can be detected and used as a chemical-based cue by nestmates to detect invaders and avoid usurpation. [Abstract/Link to Full Text]

Grisolia AB, Moreno VR, Campagnari F, Milazzotto MP, Garcia JF, Adania CH, Souza EB
Genetic diversity of microsatellite loci in Leopardus pardalis, Leopardus wiedii and Leopardus tigrinus.
Genet Mol Res. 2007;6(2):282-9.
The microsatellite loci FCA045, FCA077, FCA008, and FCA096 are highly variable molecular markers which were used to determine the genetic diversity in 148 captive Leopardus sp. The PCR-amplified products of microsatellite loci were characterized in ABI Prism 310 Genetic Analyzer. Allele numbers, heterozygosity, polymorphism information content, exclusive allele number, and shared alleles were calculated. Sixty-five alleles were found and their sizes ranged from 116 to 216 bp in four microsatellite loci. The heterozygosity ranged from 0.36 to 0.81 in Leopardus pardalis, 0.57 to 0.67 in L. tigrinus and 0.80 to 0.92 in L. wiedii. The polymorphism information content was from 0.80 to 0.88 in L. pardalis, 0.76 to 0.88 in L. tigrinus and 0.77 to 0.90 in L. wiedii. The margay (L. wiedii) showed the highest index of polymorphism among the three species in this study. These results imply that microsatellite DNA markers can help in the study of the genetic diversity of Leopardus specimens. [Abstract/Link to Full Text]

Basualdo M, Rodr�guez EM, Bedascarrasbure E, De Jong D
Selection and estimation of the heritability of sunflower (Helianthus annuus) pollen collection behavior in Apis mellifera colonies.
Genet Mol Res. 2007;6(2):274-81.
We selected honey bee colonies (Apis mellifera L.) with a high tendency to collect sunflower pollen and estimated the heritability of this trait. The percentage of sunflower pollen collected by 74 colonies was evaluated. Five colonies that collected the highest percentages of sunflower pollen were selected. Nineteen colonies headed by daughters of these selected queens were evaluated for this characteristic in comparison with 20 control (unselected) colonies. The variation for the proportion of sunflower pollen was greater among colonies of the control group than among these selected daughter colonies. The estimated heritability was 0.26 +/- 0.23, demonstrating that selection to increase sunflower pollen collection is feasible. Such selected colonies could be used to improve sunflower pollination in commercial fields. [Abstract/Link to Full Text]

Hoenigsberg HF
From geochemistry and biochemistry to prebiotic evolution...we necessarily enter into G�nti's fluid automata.
Genet Mol Res. 2007;6(2):258-73.
The present study is just an overview of the opening of the geochemical stage for the appearance of life. But that opening would not have been sufficient for the intellectual discovery of the origin of life! The excellent works and many commendable efforts that advance this explanation have not shown the fundamental elements that participate in the theoretical frame of biological evolution. The latter imply the existence of evolutionary transitions and the production of new levels of organization. In this brief analysis we do not intend to introduce the audience to the philosophy of biology. But we do expect to provide a modest overview, in which the geochemical chemolithoautotrophic opening of the stage should be seen, at most, as the initial metabolism that enabled organic compounds to follow the road where a chemical fluid machinery was thus able to undertake the more "sublime" course of organic biological evolution. We think that Tibor G�nti's chemoton is the most significant contribution to theoretical biology, and the only course now available to comprehend the unit of evolution problem without the structuralist and functionalist conflict prevalent in theoretical biology. In our opinion G�nti's chemoton theory travels to the "locus" where evolutionary theory dares to extend itself to entities at many levels of structural organization, beyond the gene or the group above. Therefore, in this and subsequent papers on the prebiotic conditions for the eventual appearance of the genetic code, we explore the formation and the presence of metal sulfide minerals, from the assembly of metal sulfide clusters through the precipitation of nanocrystals and the further reactions resulting in bulk metal sulfide phases. We endeavor to characterize pristine reactions and the modern surfaces, utilizing traditional surface science techniques and computational methods. Moreover, mechanistic details of the overall oxidation of metal sulfide minerals are set forth. We hope that this paper will lead our audience to accept that in a chemically oscillating system the chemoton is a model fluid state automaton capable of growth and self-reproduction. This is not simply a matter of transmitting a pattern, as in inorganic crystals; such self-reproduction must be more complex than crystal growth. Indeed that is what G�nti's theoretical and abstract model offers to us all: we finally have a philosophy of evolutionary units in theoretical biology. [Abstract/Link to Full Text]

Salim DC, Akimoto AA, Carvalho CB, Oliveira SF, Grisolia CK, Moreira JR, Klautau-Guimar�es MN
Genetic variability in maned wolf based on heterologous short-tandem repeat markers from domestic dog.
Genet Mol Res. 2007;6(2):248-57.
The maned wolf (Chrysocyon brachyurus) is the largest South American canid. Habitat loss and fragmentation, due to agricultural expansion and predatory hunting, are the main threats to this species. It is included in the official list of threatened wildlife species in Brazil, and is also protected by IUCN and CITES. Highly variable genetic markers such as microsatellites have the potential to resolve genetic relationships at all levels of the population structure (among individuals, demes or metapopulations) and also to identify the evolutionary unit for strategies for the conservation of the species. Tests were carried out to verify whether a class of highly polymorphic tetranucleotide repeats described for the domestic dog effectively amplifies DNA in the maned wolf. All five loci studied were amplified; however, one of these, was shown to be monomorphic in 69 maned wolf samples. The average allele number and estimated heterozygosity per polymorphic locus were 4.3 and 67%, respectively. The genetic variability found for this species, which is considered threatened with extinction, showed similar results when compared to studies of other canids. [Abstract/Link to Full Text]

Mi�o CI, Del Lama SN
Genetic structure in Brazilian breeding colonies of the Roseate Spoonbill (Platalea ajaja, Aves: Threskiornithidae).
Genet Mol Res. 2007;6(2):238-47.
Roseate Spoonbills (Platalea ajaja, Linnaeus) are wading birds present in two of the most important Brazilian wetlands: the Pantanal wetlands and Rio Grande do Sul marshes. Natural populations of these species have not been previously studied with variable nuclear molecular markers. In order to support decision making regarding the management and conservation of these populations, we estimated and characterized the distribution of genetic variability among five Brazilian breeding colonies. The average observed heterozygosity in Brazilian Roseate Spoonbill populations (Ho = 0.575) did not differ significantly from the value determined in a U.S. wild-caught sample of 15 individuals, using data generated by the same set of microsatellite loci. Considering that the U.S. population underwent a recent reduction in size, we discuss this result supposing that the U.S. population was not genetically affected or that both populations had suffered a bottleneck. Global F(ST) indicated the lack of genetic differentiation among colonies, indicating the occurrence of past and/or present gene flow among them. Analysis of molecular variance revealed that most of the genetic variation is distributed within the colonies. Results are explained by a recent origin of colonies or by high levels of gene flow. Management decisions should take into consideration the fact that, even in the presence of high genetic exchange, ecological adaptations to different environments are important for species survival. [Abstract/Link to Full Text]

Stehling EG, Campos TA, Azevedo V, Brocchi M, Silveira WD
DNA sequencing of a pathogenicity-related plasmid of an avian septicemic Escherichia coli strain.
Genet Mol Res. 2007;6(2):231-7.
A 43-MDa conjugative plasmid isolated from an avian septicemic Escherichia coli (APEC) strain possessing genes related to the adhesion and invasion capacities of in vitro-cultured cells was sequenced. The results demonstrated that the 43-MDa plasmid harbors bacterial pathogenicity-related sequences which probably allow the wild-type pathogenic strain to adhere to and invade tissues and to cause septicemia in poultry. The existence of homology sequences to sequences belonging to other human pathogenic Enterobacteriaceae like Escherichia coli O157:H7, Shigella and Salmonella was also observed. The presence of these sequences in this plasmid could indicate that there is horizontal genetic transfer between bacterial strains isolated from different host species. In conclusion, the present study suggests that APEC strains harbor high-molecular weight plasmids that present pathogenicity-related sequences and that these are probably responsible for the pathogenicity exhibited by these strains. The presence of human pathogenicity-associated sequences in APEC conjugative plasmids suggests that these strains could represent a zoonotic risk. [Abstract/Link to Full Text]

Leite KC, Collevatti RG, Menegasso TR, Tomas WM, Duarte JM
Transferability of microsatellite loci from Cervidae species to the endangered Brazilian marsh deer, Blastocerus dichotomus.
Genet Mol Res. 2007;6(2):225-30.
Blastocerus dichotomus, the marsh deer, is the largest Brazilian Cervidae species. The species is endangered because of hunting and loss of its natural habitat, i.e., flood plain areas, because of hydroelectric power station construction and agricultural land expansion. In the present study, we tested 38 microsatellite loci from four Cervidae species: Odocoileus virginianus (7), Rangifer tarandus (17), Capreolus capreolus (7), and Mazama bororo (7). Eleven loci showed clear amplification, opening a new perspective for the generation of fundamental population genetic data for devising conservation strategies for B. dichotomus. [Abstract/Link to Full Text]

Lins TC, Nogueira LR, Lima RM, Gentil P, Oliveira RJ, Pereira RW
A multiplex single-base extension protocol for genotyping Cdx2, FokI, BsmI, ApaI, and TaqI polymorphisms of the vitamin D receptor gene.
Genet Mol Res. 2007;6(2):216-24.
The well-described role of the vitamin D endocrine system in bone metabolism makes its receptor a widely investigated candidate gene in association studies looking for the genetic basis of complex bone-related phenotypes. Most association studies genotype five polymorphic sites along the gene using PCR-RFLP and allele-specific amplification methods, which may not be the better choice in large case/control or cross-sectional studies. In this case, genotyping SNPs in parallel and using automated allele-calling methods are important to decrease genotyping errors due to manual data handling and save sample in cases where the amount of DNA is limited. The aim of this study was to present a straightforward method based on multiplex PCR amplification followed by multiplex single-base extension as a simple way to genotype five vitamin D receptor gene polymorphisms in parallel, which may be implemented in medium- to large-scale case/control or cross-sectional studies. The results regarding method feasibility and optimization are presented by genotyping eight paternity trios and seven samples of Brazilian postmenopausal women who took part in an ongoing association study carried out by members of our group. [Abstract/Link to Full Text]

Fuzinatto VA, Pagliarini MS, Valle CB
Evidence of programmed cell death during microsporogenesis in an interspecific Brachiaria (Poaceae: Panicoideae: Paniceae) hybrid.
Genet Mol Res. 2007;6(2):208-15.
Morphological changes have been investigated during plant programmed cell death (PCD) in the last few years due to the new interest in a possible apoptotic-like phenomenon existing in plants. Although PCD has been reported in several tissues and specialized cells in plants, there have been few reports of its occurrence during microsporogenesis. The present study reports a typical process of PCD during meiosis in an interspecific Brachiaria hybrid leading to male sterility. In this hybrid, some inflorescences initiated meiosis but it was arrested in zygotene/pachytene. From this stage, meiocytes underwent a severe alteration in shape showing substantial membrane blebbing; the cytoplasm became denser at the periphery; the cell nucleus entered a progressive stage of chromatin disintegration, and then the nucleolus disintegrated, and the cytoplasm condensed and shrunk. The oldest flowers of the raceme showed only the callose wall in the anthers showing obvious signs of complete sterility. [Abstract/Link to Full Text]

P�rez IA, Santana SP, Argudin TD, Gardon DO
Analysis of blood processing conditions to obtain high-quality total RNA from human leukocyte concentrate.
Genet Mol Res. 2007;6(2):198-207.
Blood samples are used as a biological source to discover biomarkers of hematological and non-hematological disorders. The present study shows the impact of different experimental conditions associated with cell lysis buffer, TRI-reagent protocol and blood cell storage buffer and their correlation with the quantity, quality and Adrenomedullin gene expression levels of total RNA when RT-PCR technique is used. A leukocyte cell bank protocol is also proposed for further mRNA expression analysis using RNAlater as storage buffer. There is evidence that total RNA isolated from leukocyte concentrate stored for 1 month at -70 degrees C did not show significant differences concerning quality, purity and Adrenomedullin gene expression compared with the freshly processed leukocyte sample. [Abstract/Link to Full Text]

Nassar NM, Sousa MV
Amino acid profile in cassava and its interspecific hybrid.
Genet Mol Res. 2007;6(2):192-7.
Cassava roots have a low-protein content (0.7-2%). Amino acids such as lysine and methionine are also low, and some research reports have indicated the absence of methionine. The amino acid profiles of a common cassava cultivar and an interspecific hybrid, namely ICB 300, were determined using the computerized amino acid analyzer Hitachi L-8500. The interspecific hybrid has 10 times more lysine and 3 times more methionine than the common cassava cultivar: lysine content was 0.010 g per 100 g in the common cassava cultivar while it reached 0.098 in the interspecific hybrid. Methionine in the common cassava cultivar was 0.014 g per 100 g whereas it reached 0.041 g per 100 g in the interspecific hybrid. Total amino acid content in the common cassava cultivar was 0.254 g per 100 g viz. a viz. 1.664 g per 100 g in the interspecific hybrid. The genetic variability of the profile and quantity of amino acids indicate the feasibility of selecting interspecific hybrids that are rich in both crude protein and amino acids. This is the first report of high true protein in cassava root. [Abstract/Link to Full Text]

Das JK, Khuda-Bukhsh AR
Preponderance of GC-rich sites in silver-stained nucleolus organizing regions of Rita rita (Hamilton) and Mystus gulio (Hamilton) (Bagridae, Pisces), as revealed by chromomycin A3-staining technique and scanning electron microscopic studies.
Genet Mol Res. 2007;6(2):184-91.
The karyotypes of two species of catfish, Rita rita (Hamilton) (2n = 54; 14m + 34sm + 6st; NF = 102) and Mystus gulio (Hamilton) (2n = 58; 30m + 12sm + 2st + 14t, NF = 100) were studied through Giemsa-, silver- and chromomycin A(3)-staining techniques. The silver-stained karyotypes in both sexes of R. rita and M. gulio revealed that the nucleolus organizing regions were located terminally at the shorter arms (Tp) of one pair of submetacentric chromosomes, placed at positions Nos. 2 and 1, respectively, which was confirmed by scanning electron microscopy. Staining with a GC-specific fluorochrome, chromomycin A(3), produced bright fluorescence in the Ag-positive nucleolus organizer regions, suggesting thereby that nucleolus organizing regions actually included GC-rich sites of active r-RNA genes in metaphase chromosomes of these two bagrids. Further such studies are needed due to the extreme paucity of data on fish. [Abstract/Link to Full Text]

Calliari LE, Longui CA, Rocha MN, Faria CD, Kochi C, Melo MR, Melo MB, Monte O
A novel mutation in DAX1 gene causing different phenotypes in three siblings with adrenal hypoplasia congenita.
Genet Mol Res. 2007;6(2):177-83.
Adrenal hypoplasia congenita (AHC) is a rare disease that can be caused by many abnormalities, including an X-linked form. Mutations in the DAX1 gene have been assigned as the genetic cause of AHC. We describe here three siblings with AHC, clinically presented at different ages, two in the neonatal period and one oligosymptomatic during infancy. Molecular analysis was able to detect a novel mutation in exon 1 of the DAX1 gene, consisting of a transition of C to T at position 359, determining a stop codon at position 359 (Q359X). The mutated gene encodes a truncated protein missing a large portion of the ligand-binding domain (C-terminal domain). The recognition of the disease in the index case suggested the diagnosis in the other siblings. Interestingly, the same mutation is presented with different phenotypes, suggesting that first-degree family members of patients with DAX1 mutations should be carefully evaluated routinely. [Abstract/Link to Full Text]

Campos SR, Rieger TT, Santos JF
Homology of polytene elements between Drosophila and Zaprionus determined by in situ hybridization in Zaprionus indianus.
Genet Mol Res. 2007;6(2):162-76.
The drosophilid Zaprionus indianus due to its economical importance as an insect pest in Brazil deserves more investigation into its genetics. Its mitotic karyotype and a line-drawing map of its polytene chromosomes are already available. This paper presents a photomap of Z. indianus polytene chromosomes, which was used as the reference map for identification of sections marked by in situ hybridization with gene probes. Hybridization signals for Hsp70 and Hsr-omega were detected, respectively, in sections 34B and 32C of chromosome V of Z. indianus, which indicates its homology to the chromosomal arm 3R of Drosophila melanogaster and, therefore, to Muller's element E. The main signal for Hsp83 gene probe hybridization was in section 17C of Z. indianus chromosome III, suggesting its homology to arm 3L of D. melanogaster and to element D of Muller. The Ubi probe hybridized in sections 10C of chromosome II and 17A of chromosome III. Probably the 17A is the polyubiquitin locus, with homology to arm 3L of D. melanogaster and to the mullerian D element, as suggested also by Hsp83 gene location. The Br-C gene was mapped in section 1D, near the tip of the X chromosome, indicating its homology to the X chromosome of D. melanogaster and to mullerian element A. The Dpp gene probe hybridized mainly in the section 32A of chromosome V and, at lower frequencies to other sections, although no signal was observed as expected in the correspondent mullerian B element. This result led to the suggestion of a rearrangement including the Dpp locus in Z. indianus, the secondary signals possibly pointing to related genes of the TGF-beta family. In conclusion, the results indicate that chromosomes X, III, V of Z. indianus are respectively correspondents to elements A, D, and E of Muller. At least chromosome V of Z. indianus seems to share synteny with the 3R arm of D. melanogaster, as indicated by the relative positions of Hsp70 and Hsr-omega, although the Dpp gene indicates a disruption of synteny in its distal region. [Abstract/Link to Full Text]

Gon�alves VF, Prosdocimi F, Santos LS, Ortega JM, Pena SD
Sex-biased gene flow in African Americans but not in American Caucasians.
Genet Mol Res. 2007;6(2):156-61.
We have previously shown evidence of strong sex-biased genetic blending in the founding and ongoing history of the Brazilian population, with the African and Amerindian contribution being highest from maternal lineages (as measured by mitochondrial DNA) and the European contribution foremost from paternal lineages (estimated from Y-chromosome haplogroups). The same phenomenon has been observed in several other Latin American countries, suggesting that it might constitute a universal characteristic of the Iberian colonization of the Americas. However, it has also recently been detected in the Black population of the United States. We thus wondered if the same could be observed in American Caucasians. To answer that question, we retrieved 1387 hypervariable I Caucasian mitochondrial DNA sequences from the FBI population database and established their haplogroups and continental geographical sources. In sharp contrast with the situation of the Caucasian population of Latin American countries, only 3.1% of the American Caucasian sequences had African and/or Amerindian origin. To explain this discrepancy we propose that the finding of elevated genomic contributions from European males and Amerindian or African females depends not only on the occurrence of directional mating, but also on the "racial" categorization of the children born from these relations. In this respect, social practices in Latin America and in the United States diverge considerably; in the former socially significant "races" are normally designated according to physical appearance, while in the latter descent appears to be the most important factor. [Abstract/Link to Full Text]

Lopes DO, Regis-da-Silva CG, Machado-Silva A, Macedo AM, Franco GR, Hoffmann JS, Cazaux C, Pena SD, Teixeira SM, Machado CR
Analysis of DNA polymerase activity in vitro using non-radioactive primer extension assay in an automated DNA sequencer.
Genet Mol Res. 2007;6(2):150-5.
Although different DNA polymerases have distinct functions and substrate affinities, their general mechanism of action is similar. Thus, they can all be studied using the same technical principle, the primer extension assay employing radioactive tags. Even though fluorescence has been used routinely for many years for DNA sequencing, it has not been used in the in vitro primer extension assay. The use of fluorescence labels has obvious advantages over radioactivity, including safety, speed and ease of manipulation. In the present study, we demonstrated the potential of non-radioactive in vitro primer extension for DNA polymerase studies. By using an M13 tag in the substrate, we can use the same fluorescent M13 primer to study different substrate sequences. This technique allows quantification of the DNA polymerase activity of the Klenow fragment using different templates and under different conditions with similar sensitivity to the radioactive assay. [Abstract/Link to Full Text]

Thomas MG, Enns RM, Shirley KL, Garcia MD, Garrett AJ, Silver GA
Associations of DNA polymorphisms in growth hormone and its transcriptional regulators with growth and carcass traits in two populations of Brangus bulls.
Genet Mol Res. 2007;6(1):222-37.
Sequence polymorphisms in the growth hormone (GH) gene and its transcriptional regulators, Pit-1 and Prop-1, were evaluated for associations with growth and carcass traits in two populations of Brangus bulls Chihuahuan Desert Rangeland Research Center (CDRRC, N = 248 from 14 sires) and a cooperating breeding program (COOP, N = 186 from 34 sires). Polymorphisms were SNP mutations in intron 4 (C/T) and exon V (C/G) in GH, A/G in exon VI in Pit-1, and A/G in exon III in Prop-1. In the COOP population, bulls of Pit-1 GG genotype had a significantly greater percentage of intramuscular fat than bulls of the AA or AG genotype, and bulls of the Prop-1 AA genotype had significantly greater scrotal circumference than bulls of AG or GG genotypes at ~365 days of age. Also, heterozygous genotypes for the two GH polymorphisms appeared advantageous for traits of muscularity and adiposity in the COOP population. The heterozygous genotype of GH intron 4 SNP was associated with advantages in weight gain, scrotal circumference, and fat thickness in the CDRRC population. The two GH polymorphisms accounted for >/=27.7% of the variation in these traits in the CDRRC population; however, R(2) was <5% in the COOP population. Based on haplotype analyses the two GH SNPs appeared to be in phase; the haplotype analyses also paralleled with the genotype analyses. Polymorphisms in GH and its transcriptional regulators appear to be predictors of growth and carcass traits in Brangus bulls, particularly those with heterozygous GH genotypes. [Abstract/Link to Full Text]

Jos� AA, Gama MA, Urban A, Merighe GK, Meirelles FV, Etchegaray MA, Lanna DP
Evaluation of polyvinyl alcohol for fatty acid supplementation in adipose tissue explant culture.
Genet Mol Res. 2007;6(1):214-21.
Cultures of adipose tissue explants are a valuable tool for studying the intracellular mechanisms involving hormones and nutrients. However, testing how fatty acids affect cells requires a carrier molecule; bovine serum albumin (BSA) has been used for this purpose. However, contaminants can alter the cellular response. Our objectives were to: 1) test BSA as a fatty acid carrier and 2) evaluate polyvinyl alcohol (PVA) as a replacement for BSA. Adipose tissue explants from nine pigs were cultured in medium 199 for 4, 12, 24, and 48 h, with the following treatments: control, PVA (100 mM PVA added) and PVA + pGH (100 mM PVA plus 0.1 mg/mL porcine growth hormone). After each culture period, explants were collected and assayed for lipogenesis. After 48 h in culture, explants were assayed for lipolysis. A preliminary study with different commercial sources and high concentrations showed that BSA affected lipogenic rates. On the other hand, there were no effects of PVA on lipid synthesis, while pGH (positive control) reduced glucose incorporation into lipids (P < 0.01) when compared to both control and PVA (P < 0.05). There was no difference between control and PVA for lipolysis rates. However, pGH increased lipolysis when compared to control (P < 0.01) and PVA (P < 0.05). We demonstrated that BSA can alter lipogenesis, which precludes its use as a carrier molecule. On the other hand, addition of PVA had no effect on lipolysis or lipogenesis. We suggest the use of PVA instead of BSA for adding bioactive fatty acids to cultures of adipose tissue. [Abstract/Link to Full Text]

Allen ML
Expressed sequenced tags from Lygus lineolaris (Hemiptera: Miridae), the tarnished plant bug.
Genet Mol Res. 2007;6(1):206-13.
Expressed sequenced tags (ESTs) were prepared to establish a baseline for molecular genetic studies of the tarnished plant bug, Lygus lineolaris (Palisot de Beauvois). The largest class of identifiable ESTs (15.2%) was from genes involved in cellular metabolic functions, including physiological processes. Twenty-seven ESTs (9.8%) were from genes associated with transcription and translation, including ribosomal genes. One hundred and forty-two of the 276 unique ESTs were from genes not previously identified from any organism. Twelve sequences appear to be associated with feeding and digestion and may be targets for pest control studies. [Abstract/Link to Full Text]

Anh� AC, Lima-Oliveira AP, Azeredo-Oliveira MT
Acid phosphatase activity distribution in salivary glands of triatomines (Heteroptera, Reduviidae, Triatominae).
Genet Mol Res. 2007;6(1):197-205.
Acid phosphatase activity (G�mori technique) in salivary gland cells was investigated in adult insects (males and females) of four species of triatomines: Triatoma infestans, Panstrongylus megistus, Rhodnius neglectus, and Rhodnius prolixus. Binucleated cells with bulky and polyploidy nuclei were detected, with acid phosphatase activity in the heterochromatin and nucleolus, which showed the most intense response. Thus, the activity of these phosphatases during rRNA molecule transcription, possibly in the nucleolar fibrillar center, is suggested. The difference in reactivity found among salivary glands is associated with the cellular metabolism of these regions and, probably, with the biosynthesis of their different secretions. This must be essential in maintaining the hematophagy of triatomines. [Abstract/Link to Full Text]

Horimoto AR, Ferraz JB, Balieiro JC, Eler JP
Phenotypic and genetic correlations for body structure scores (frame) with productive traits and index for CEIP classification in Nellore beef cattle.
Genet Mol Res. 2007;6(1):188-96.
The present study was carried out to estimate both (co)variance components and genetic parameters for frame scores obtained using two methods (FRAME_GMA and FRAME_BIF) as well as phenotypic and genetic correlations with traits such as weaning weight, weight gain from weaning to yearling, scrotal circumference, muscle score, and an empiric index for animal classification for the Special Certificate of Identification and Production (CEIP). Data on 12,728 animals, raised in Southeastern Brazil, with ages from 490 to 610 days were analyzed. Estimates of heritability for FRAME_GMA and FRAME_BIF in multi-trait analysis were 0.28 and 0.24, respectively. Genetic correlation coefficients between frame scores and the growth trait were of medium magnitude, which indicates that genetic selection for weight resulted in undesirable responses, increasing the animals' frames. Small changes should be expected in the frame of animals that have been submitted to a genetic selection regarding muscle score and scrotal circumference. The low magnitude of phenotypic and genetic correlation between frame scores and the empirical selection index that classifies animals for CEIP, a Brazilian official certificate that recognizes the value of seedstock that is not registered at breeders associations, but is genetically evaluated, does not indicate important responses in giving a CEIP to animals that have been directly or indirectly selected for frame. Other studies must be performed to determine estimates of the genetic parameters for frame scores in other beef cattle populations. [Abstract/Link to Full Text]

Ribeiro RA, Lovato MB
Comparative analysis of different DNA extraction protocols in fresh and herbarium specimens of the genus Dalbergia.
Genet Mol Res. 2007;6(1):173-87.
Five published DNA extraction protocols were compared for their ability to produce good quality DNA from fresh and herbarium leaves of several species of the genus Dalbergia. The leaves of these species contain high amounts of secondary metabolites, which make it difficult to perform a clean DNA extraction and thereby interfering with subsequent PCR amplification. The protocol that produced the best DNA quality in most of the Dalbergia species analyzed, utilizes polyvinylpyrrolidone to bind the phenolic compounds, a high molar concentration of NaCl to inhibit co-precipitation of polysaccharides and DNA, and LiCl for removing RNA by selective precipitation. The DNA quality of herbarium specimens was worse than that for fresh leaves, due to collecting conditions and preservation of samples. We analyzed 54 herbarium specimens, but the recovered DNA allowed successful PCR amplification in only eight. For the genus Dalbergia, the herbarium is an important source of material for phylogenetic and evolutionary studies; due to the occurrence of the different species in various geographical regions in Brazil, it is difficult to obtain fresh material in nature. Our results demonstrated that for Dalbergia species the methods used for the collection and preservation of herbarium specimens have a mayor influence on DNA quality and in the success of phylogenetic studies of the species. [Abstract/Link to Full Text]