Free Full Text Journal Articles: Genetics and Proteomics -- Neurotransmitter.net

Recent Articles in Nucleic Acids Research

Bourdeau V, Desch�nes J, Laperri�re D, Aid M, White JH, Mader S
Mechanisms of primary and secondary estrogen target gene regulation in breast cancer cells.
Nucleic Acids Res. 2007 Nov 5; .
Estrogen receptors (ERs), which mediate the proliferative action of estrogens in breast cancer cells, are ligand-dependent transcription factors that regulate expression of their primary target genes through several mechanisms. In addition to direct binding to cognate DNA sequences, ERs can be recruited to DNA through other transcription factors (tethering), or affect gene transcription through modulation of signaling cascades by non-genomic mechanisms of action. To better characterize the mechanisms of gene regulation by estrogens, we have identified more than 700 putative primary and about 1300 putative secondary target genes of estradiol in MCF-7 cells through microarray analysis performed in the presence or absence of the translation inhibitor cycloheximide. Although siRNA-mediated inhibition of ERalpha expression antagonized the effects of estradiol on up- and down-regulated primary target genes, estrogen response elements (EREs) were enriched only in the vicinity of up-regulated genes. Binding sites for several other transcription factors, including proteins known to tether ERalpha, were enriched in up- and/or down-regulated primary targets. Secondary estrogen targets were particularly enriched in sites for E2F family members, several of which were transcriptionally regulated by estradiol, consistent with a major role of these factors in mediating the effects of estrogens on gene expression and cellular growth. [Abstract/Link to Full Text]

Mangone M, Macmenamin P, Zegar C, Piano F, Gunsalus KC
UTRome.org: a platform for 3'UTR biology in C. elegans.
Nucleic Acids Res. 2007 Nov 22;
Three-prime untranslated regions (3'UTRs) are widely recognized as important post-transcriptional regulatory regions of mRNAs. RNA-binding proteins and small non-coding RNAs such as microRNAs (miRNAs) bind to functional elements within 3'UTRs to influence mRNA stability, translation and localization. These interactions play many important roles in development, metabolism and disease. However, even in the most well-annotated metazoan genomes, 3'UTRs and their functional elements are not well defined. Comprehensive and accurate genome-wide annotation of 3'UTRs and their functional elements is thus critical. We have developed an open-access database, available at http://www.UTRome.org, to provide a rich and comprehensive resource for 3'UTR biology in the well-characterized, experimentally tractable model system Caenorhabditis elegans. UTRome.org combines data from public repositories and a large-scale effort we are undertaking to characterize 3'UTRs and their functional elements in C. elegans, including 3'UTR sequences, graphical displays, predicted and validated functional elements, secondary structure predictions and detailed data from our cloning pipeline. UTRome.org will grow substantially over time to encompass individual 3'UTR isoforms for the majority of genes, new and revised functional elements, and in vivo data on 3'UTR function as they become available. The UTRome database thus represents a powerful tool to better understand the biology of 3'UTRs. [Abstract/Link to Full Text]

Okuno Y, Tamon A, Yabuuchi H, Niijima S, Minowa Y, Tonomura K, Kunimoto R, Feng C
GLIDA: GPCR ligand database for chemical genomics drug discovery database and tools update.
Nucleic Acids Res. 2007 Nov 5;
G-protein coupled receptors (GPCRs) represent one of the most important families of drug targets in pharmaceutical development. GLIDA is a public GPCR-related Chemical Genomics database that is primarily focused on the integration of information between GPCRs and their ligands. It provides interaction data between GPCRs and their ligands, along with chemical information on the ligands, as well as biological information regarding GPCRs. These data are connected with each other in a relational database, allowing users in the field of Chemical Genomics research to easily retrieve such information from either biological or chemical starting points. GLIDA includes a variety of similarity search functions for the GPCRs and for their ligands. Thus, GLIDA can provide correlation maps linking the searched homologous GPCRs (or ligands) with their ligands (or GPCRs). By analyzing the correlation patterns between GPCRs and ligands, we can gain more detailed knowledge about their conserved molecular recognition patterns and improve drug design efforts by focusing on inferred candidates for GPCR-specific drugs. This article provides a summary of the GLIDA database and user facilities, and describes recent improvements to database design, data contents, ligand classification programs, similarity search options and graphical interfaces. GLIDA is publicly available at http://pharminfo.pharm.kyoto-u.ac.jp/services/glida/. We hope that it will prove very useful for Chemical Genomics research and GPCR-related drug discovery. [Abstract/Link to Full Text]

Levy A, Sela N, Ast G
TranspoGene and microTranspoGene: transposed elements influence on the transcriptome of seven vertebrates and invertebrates.
Nucleic Acids Res. 2007 Nov 5;
Transposed elements (TEs) are mobile genetic sequences. During the evolution of eukaryotes TEs were inserted into active protein-coding genes, affecting gene structure, expression and splicing patterns, and protein sequences. Genomic insertions of TEs also led to creation and expression of new functional non-coding RNAs such as microRNAs. We have constructed the TranspoGene database, which covers TEs located inside protein-coding genes of seven species: human, mouse, chicken, zebrafish, fruit fly, nematode and sea squirt. TEs were classified according to location within the gene: proximal promoter TEs, exonized TEs (insertion within an intron that led to exon creation), exonic TEs (insertion into an existing exon) or intronic TEs. TranspoGene contains information regarding specific type and family of the TEs, genomic and mRNA location, sequence, supporting transcript accession and alignment to the TE consensus sequence. The database also contains host gene specific data: gene name, genomic location, Swiss-Prot and RefSeq accessions, diseases associated with the gene and splicing pattern. In addition, we created microTranspoGene: a database of human, mouse, zebrafish and nematode TE-derived microRNAs. The TranspoGene and microTranspoGene databases can be used by researchers interested in the effect of TE insertion on the eukaryotic transcriptome. Publicly available query interfaces to TranspoGene and microTranspoGene are available at http://transpogene.tau.ac.il/ and http://microtranspogene.tau.ac.il, respectively. The entire database can be downloaded as flat files. [Abstract/Link to Full Text]

Sprenger J, Fink JL, Karunaratne S, Hanson K, Hamilton NA, Teasdale RD
LOCATE: a mammalian protein subcellular localization database.
Nucleic Acids Res. 2007 Nov 5;
LOCATE is a curated, web-accessible database that houses data describing the membrane organization and subcellular localization of mouse and human proteins. Over the past 2 years, the data in LOCATE have grown substantially. The database now contains high-quality localization data for 20% of the mouse proteome and general localization annotation for nearly 36% of the mouse proteome. The proteome annotated in LOCATE is from the RIKEN FANTOM Consortium Isoform Protein Sequence sets which contains 58 128 mouse and 64 637 human protein isoforms. Other additions include computational subcellular localization predictions, automated computational classification of experimental localization image data, prediction of protein sorting signals and third party submission of literature data. Collectively, this database provides localization proteome for individual subcellular compartments that will underpin future systematic investigations of these regions. It is available at http://locate.imb.uq.edu.au/ [Abstract/Link to Full Text]

Wang CK, Kaas Q, Chiche L, Craik DJ
CyBase: a database of cyclic protein sequences and structures, with applications in protein discovery and engineering.
Nucleic Acids Res. 2007 Nov 5;
CyBase was originally developed as a database for backbone-cyclized proteins, providing search and display capabilities for sequence, structure and function data. Cyclic proteins are interesting because, compared to conventional proteins, they have increased stability and enhanced binding affinity and therefore can potentially be developed as protein drugs. The new CyBase release features a redesigned interface and internal architecture to improve user-interactivity, collates double the amount of data compared to the initial release, and hosts a novel suite of tools that are useful for the visualization, characterization and engineering of cyclic proteins. These tools comprise sequence/structure 2D representations, a summary of grafting and mutation studies of synthetic analogues, a study of N- to C-terminal distances in known protein structures and a structural modelling tool to predict the best linker length to cyclize a protein. These updates are useful because they have the potential to help accelerate the discovery of naturally occurring cyclic proteins and the engineering of cyclic protein drugs. The new release of CyBase is available at http://research1t.imb.uq.edu.au/cybase. [Abstract/Link to Full Text]

Swarbreck D, Wilks C, Lamesch P, Berardini TZ, Garcia-Hernandez M, Foerster H, Li D, Meyer T, Muller R, Ploetz L, Radenbaugh A, Singh S, Swing V, Tissier C, Zhang P, Huala E
The Arabidopsis Information Resource (TAIR): gene structure and function annotation.
Nucleic Acids Res. 2007 Nov 5;
The Arabidopsis Information Resource (TAIR, http://arabidopsis.org) is the model organism database for the fully sequenced and intensively studied model plant Arabidopsis thaliana. Data in TAIR is derived in large part from manual curation of the Arabidopsis research literature and direct submissions from the research community. New developments at TAIR include the addition of the GBrowse genome viewer to the TAIR site, a redesigned home page, navigation structure and portal pages to make the site more intuitive and easier to use, the launch of several TAIR web services and a new genome annotation release (TAIR7) in April 2007. A combination of manual and computational methods were used to generate this release, which contains 27 029 protein-coding genes, 3889 pseudogenes or transposable elements and 1123 ncRNAs (32 041 genes in all, 37 019 gene models). A total of 681 new genes and 1002 new splice variants were added. Overall, 10 098 loci (one-third of all loci from the previous TAIR6 release) were updated for the TAIR7 release. [Abstract/Link to Full Text]

Laun P, Bruschi CV, Richard Dickinson J, Rinnerthaler M, Heeren G, Schwimbersky R, Rid R, Breitenbach M
Yeast mother cell-specific ageing, genetic (in)stability, and the somatic mutation theory of ageing.
Nucleic Acids Res. 2007 Dec 11;
Yeast mother cell-specific ageing is characterized by a limited capacity to produce daughter cells. The replicative lifespan is determined by the number of cell cycles a mother cell has undergone, not by calendar time, and in a population of cells its distribution follows the Gompertz law. Daughter cells reset their clock to zero and enjoy the full lifespan characteristic for the strain. This kind of replicative ageing of a cell population based on asymmetric cell divisions is investigated as a model for the ageing of a stem cell population in higher organisms. The simple fact that the daughter cells can reset their clock to zero precludes the accumulation of chromosomal mutations as the cause of ageing, because semiconservative replication would lead to the same mutations in the daughters. However, nature is more complicated than that because, (i) the very last daughters of old mothers do not reset the clock; and (ii) mutations in mitochondrial DNA could play a role in ageing due to the large copy number in the cell and a possible asymmetric distribution of damaged mitochondrial DNA between mother and daughter cell. Investigation of the loss of heterozygosity in diploid cells at the end of their mother cell-specific lifespan has shown that genomic rearrangements do occur in old mother cells. However, it is not clear if this kind of genomic instability is causative for the ageing process. Damaged material other than DNA, for instance misfolded, oxidized or otherwise damaged proteins, seem to play a major role in ageing, depending on the balance between production and removal through various repair processes, for instance several kinds of proteolysis and autophagy. We are reviewing here the evidence for genetic change and its causality in the mother cell-specific ageing process of yeast. [Abstract/Link to Full Text]

Heazlewood JL, Durek P, Hummel J, Selbig J, Weckwerth W, Walther D, Schulze WX
PhosPhAt: a database of phosphorylation sites in Arabidopsis thaliana and a plant-specific phosphorylation site predictor.
Nucleic Acids Res. 2007 Nov 4;
The PhosPhAt database provides a resource consolidating our current knowledge of mass spectrometry-based identified phosphorylation sites in Arabidopsis and combines it with phosphorylation site prediction specifically trained on experimentally identified Arabidopsis phosphorylation motifs. The database currently contains 1187 unique tryptic peptide sequences encompassing 1053 Arabidopsis proteins. Among the characterized phosphorylation sites, there are over 1000 with unambiguous site assignments, and nearly 500 for which the precise phosphorylation site could not be determined. The database is searchable by protein accession number, physical peptide characteristics, as well as by experimental conditions (tissue sampled, phosphopeptide enrichment method). For each protein, a phosphorylation site overview is presented in tabular form with detailed information on each identified phosphopeptide. We have utilized a set of 802 experimentally validated serine phosphorylation sites to develop a method for prediction of serine phosphorylation (pSer) in Arabidopsis. An analysis of the current annotated Arabidopsis proteome yielded in 27 782 predicted phosphoserine sites distributed across 17 035 proteins. These prediction results are summarized graphically in the database together with the experimental phosphorylation sites in a whole sequence context. The Arabidopsis Protein Phosphorylation Site Database (PhosPhAt) provides a valuable resource to the plant science community and can be accessed through the following link http://phosphat.mpimp-golm.mpg.de. [Abstract/Link to Full Text]

Bowes JB, Snyder KA, Segerdell E, Gibb R, Jarabek C, Noumen E, Pollet N, Vize PD
Xenbase: a Xenopus biology and genomics resource.
Nucleic Acids Res. 2007 Nov 4;
Xenbase (www.xenbase.org) is a model organism database integrating a diverse array of biological and genomic data on the frogs, Xenopus laevis and Xenopus (Silurana) tropicalis. Data is collected from other databases, high-throughput screens and the scientific literature and integrated into a number of database modules covering subjects such as community, literature, gene and genomic analysis. Gene pages are automatically assembled from data piped from the Entrez Gene, Gurdon Institute, JGI, Metazome, MGI, OMIM, PubMed, Unigene, Zfin, commercial suppliers and others. These data are then supplemented with in-house annotation. Xenbase has implemented the Gbrowse genome browser and also provides a BLAST service that allows users to specifically search either laevis or tropicalis DNA or protein targets. A table of Xenopus gene synonyms has been implemented and allows the genome, genes, publications and high-throughput gene expression data to be seamlessly integrated with other Xenopus data and to external database resources, making the wealth of developmental and functional data from the frog available to the broader research community. [Abstract/Link to Full Text]

Bruford EA, Lush MJ, Wright MW, Sneddon TP, Povey S, Birney E
The HGNC Database in 2008: a resource for the human genome.
Nucleic Acids Res. 2007 Nov 4;
The HUGO Gene Nomenclature Committee (HGNC) aims to assign a unique and ideally meaningful name and symbol to every human gene. The HGNC database currently comprises over 24 000 public records containing approved human gene nomenclature and associated gene information. Following our recent relocation to the European Bioinformatics Institute our homepage can now be found at http://www.genenames.org, with direct links to the searchable HGNC database and other related database resources, such as the HCOP orthology search tool and manually curated gene family webpages. [Abstract/Link to Full Text]

The Gene Ontology project in 2008.
Nucleic Acids Res. 2007 Nov 4;
The Gene Ontology (GO) project (http://www.geneontology.org/) provides a set of structured, controlled vocabularies for community use in annotating genes, gene products and sequences (also see http://www.sequenceontology.org/). The ontologies have been extended and refined for several biological areas, and improvements to the structure of the ontologies have been implemented. To improve the quantity and quality of gene product annotations available from its public repository, the GO Consortium has launched a focused effort to provide comprehensive and detailed annotation of orthologous genes across a number of 'reference' genomes, including human and several key model organisms. Software developments include two releases of the ontology-editing tool OBO-Edit, and improvements to the AmiGO browser interface. [Abstract/Link to Full Text]

Zhang C, Crasta O, Cammer S, Will R, Kenyon R, Sullivan D, Yu Q, Sun W, Jha R, Liu D, Xue T, Zhang Y, Moore M, McGarvey P, Huang H, Chen Y, Zhang J, Mazumder R, Wu C, Sobral B
An emerging cyberinfrastructure for biodefense pathogen and pathogen host data.
Nucleic Acids Res. 2007 Nov 4;
The NIAID-funded Biodefense Proteomics Resource Center (RC) provides storage, dissemination, visualization and analysis capabilities for the experimental data deposited by seven Proteomics Research Centers (PRCs). The data and its publication is to support researchers working to discover candidates for the next generation of vaccines, therapeutics and diagnostics against NIAID's Category A, B and C priority pathogens. The data includes transcriptional profiles, protein profiles, protein structural data and host-pathogen protein interactions, in the context of the pathogen life cycle in vivo and in vitro. The database has stored and supported host or pathogen data derived from Bacillus, Brucella, Cryptosporidium, Salmonella, SARS, Toxoplasma, Vibrio and Yersinia, human tissue libraries, and mouse macrophages. These publicly available data cover diverse data types such as mass spectrometry, yeast two-hybrid (Y2H), gene expression profiles, X-ray and NMR determined protein structures and protein expression clones. The growing database covers over 23 000 unique genes/proteins from different experiments and organisms. All of the genes/proteins are annotated and integrated across experiments using UniProt Knowledgebase (UniProtKB) accession numbers. The web-interface for the database enables searching, querying and downloading at the level of experiment, group and individual gene(s)/protein(s) via UniProtKB accession numbers or protein function keywords. The system is accessible at http://www.proteomicsresource.org/. [Abstract/Link to Full Text]

Luke B, Azzalin CM, Hug N, Deplazes A, Peter M, Lingner J
Saccharomyces cerevisiae Ebs1p is a putative ortholog of human Smg7 and promotes nonsense-mediated mRNA decay.
Nucleic Acids Res. 2007 Nov 4;
The Smg proteins Smg5, Smg6 and Smg7 are involved in nonsense-mediated RNA decay (NMD) in metazoans, but no orthologs have been found in the budding yeast Saccharomyces cerevisiae. Sequence alignments reveal that yeast Ebs1p is similar in structure to the human Smg5-7, with highest homology to Smg7. We demonstrate here that Ebs1p is involved in NMD and behaves similarly to human Smg proteins. Indeed, both loss and overexpression of Ebs1p results in stabilization of NMD targets. However, Ebs1-loss in yeast or Smg7-depletion in human cells only partially disrupts NMD and in the latter, Smg7-depletion is partially compensated for by Smg6. Ebs1p physically interacts with the NMD helicase Upf1p and overexpressed Ebs1p leads to recruitment of Upf1p into cytoplasmic P-bodies. Furthermore, Ebs1p localizes to P-bodies upon glucose starvation along with Upf1p. Overall our findings suggest that NMD is more conserved in evolution than previously thought, and that at least one of the Smg5-7 proteins is conserved in budding yeast. [Abstract/Link to Full Text]

Yang J, Chen L, Sun L, Yu J, Jin Q
VFDB 2008 release: an enhanced web-based resource for comparative pathogenomics.
Nucleic Acids Res. 2007 Nov 4;
Virulence factor database (VFDB) was set up in 2004 dedicated for providing current knowledge of virulence factors (VFs) from various medical significant bacterial pathogens to facilitate pathogenomic research. Nowadays, complete genome sequences of almost all the major pathogenic microbes have been determined, which makes comparative genomics a powerful approach for uncovering novel virulence determinants and hidden aspects of pathogenesis. VFDB was therefore upgraded to present the enormous diversity of bacterial genomes in terms of virulence genes and their organization. The VFDB 2008 release includes the following new features; (i) detailed tabular comparison of virulence composition of a given genome with other genomes of the same genus, (ii) multiple alignments and statistical analysis of homologous VFs and (iii) graphical comparison of genomic organizations of virulence genes. Comparative analysis of the numerous VFs will improve our understanding of the nature and evolution of virulence, as well as the development of new therapeutic and preventive strategies. VFDB 2008 release offers more user-friendly tools for comparative pathogenomics and it is publicly accessible at http://www.mgc.ac.cn/VFs/. [Abstract/Link to Full Text]

Ulrich EL, Akutsu H, Doreleijers JF, Harano Y, Ioannidis YE, Lin J, Livny M, Mading S, Maziuk D, Miller Z, Nakatani E, Schulte CF, Tolmie DE, Kent Wenger R, Yao H, Markley JL
BioMagResBank.
Nucleic Acids Res. 2007 Nov 4;
The BioMagResBank (BMRB: www.bmrb.wisc.edu) is a repository for experimental and derived data gathered from nuclear magnetic resonance (NMR) spectroscopic studies of biological molecules. BMRB is a partner in the Worldwide Protein Data Bank (wwPDB). The BMRB archive consists of four main data depositories: (i) quantitative NMR spectral parameters for proteins, peptides, nucleic acids, carbohydrates and ligands or cofactors (assigned chemical shifts, coupling constants and peak lists) and derived data (relaxation parameters, residual dipolar couplings, hydrogen exchange rates, pK(a) values, etc.), (ii) databases for NMR restraints processed from original author depositions available from the Protein Data Bank, (iii) time-domain (raw) spectral data from NMR experiments used to assign spectral resonances and determine the structures of biological macromolecules and (iv) a database of one- and two-dimensional (1)H and (13)C one- and two-dimensional NMR spectra for over 250 metabolites. The BMRB website provides free access to all of these data. BMRB has tools for querying the archive and retrieving information and an ftp site (ftp.bmrb.wisc.edu) where data in the archive can be downloaded in bulk. Two BMRB mirror sites exist: one at the PDBj, Protein Research Institute, Osaka University, Osaka, Japan (bmrb.protein.osaka-u.ac.jp) and the other at CERM, University of Florence, Florence, Italy (bmrb.postgenomicnmr.net/). The site at Osaka also accepts and processes data depositions. [Abstract/Link to Full Text]

Halees AS, El-Badrawi R, Khabar KS
ARED Organism: expansion of ARED reveals AU-rich element cluster variations between human and mouse.
Nucleic Acids Res. 2007 Nov 4;
ARED Organism represents the expansion of the adenylate uridylate (AU)-rich element (ARE)-containing human mRNA database into the transcriptomes of mouse and rat. As a result, we performed quantitative assessment of ARE conservation in human, mouse and rat transcripts. We found that a significant proportion ( approximately 25%) of human genes differ in their ARE patterns from mouse and rat transcripts. ARED-Integrated, another updated and expanded version of ARED, is a compilation of ARED versions 1.0 to 3.0 and updated version 4.0 that is devoted to human mRNAs. Thus, ARED-Integrated and ARED-Organism databases, both publicly available at http://brp.kfshrc.edu.sa/ARED, offer scientists a comprehensive view of AREs in the human transcriptome and the ability to study the comparative genomics of AREs in model organisms. This ultimately will help in inferring the biological consequences of ARE variation in these key animal models as opposed to humans, particularly, in relationships to the role of RNA stability in disease. [Abstract/Link to Full Text]

Liang C, Jaiswal P, Hebbard C, Avraham S, Buckler ES, Casstevens T, Hurwitz B, McCouch S, Ni J, Pujar A, Ravenscroft D, Ren L, Spooner W, Tecle I, Thomason J, Tung CW, Wei X, Yap I, Youens-Clark K, Ware D, Stein L
Gramene: a growing plant comparative genomics resource.
Nucleic Acids Res. 2007 Nov 4;
Gramene (www.gramene.org) is a curated resource for genetic, genomic and comparative genomics data for the major crop species, including rice, maize, wheat and many other plant (mainly grass) species. Gramene is an open-source project. All data and software are freely downloadable through the ftp site (ftp.gramene.org/pub/gramene) and available for use without restriction. Gramene's core data types include genome assembly and annotations, other DNA/mRNA sequences, genetic and physical maps/markers, genes, quantitative trait loci (QTLs), proteins, ontologies, literature and comparative mappings. Since our last NAR publication 2 years ago, we have updated these data types to include new datasets and new connections among them. Completely new features include rice pathways for functional annotation of rice genes; genetic diversity data from rice, maize and wheat to show genetic variations among different germplasms; large-scale genome comparisons among Oryza sativa and its wild relatives for evolutionary studies; and the creation of orthologous gene sets and phylogenetic trees among rice, Arabidopsis thaliana, maize, poplar and several animal species (for reference purpose). We have significantly improved the web interface in order to provide a more user-friendly browsing experience, including a dropdown navigation menu system, unified web page for markers, genes, QTLs and proteins, and enhanced quick search functions. [Abstract/Link to Full Text]

Quyen DV, Ha SC, Lowenhaupt K, Rich A, Kim KK, Kim YG
Characterization of DNA-binding activity of Z{alpha} domains from poxviruses and the importance of the {beta}-wing regions in converting B-DNA to Z-DNA.
Nucleic Acids Res. 2007 Nov 5;
The E3L gene is essential for pathogenesis in vaccinia virus. The E3L gene product consists of an N-terminal Zalpha domain and a C-terminal double-stranded RNA (dsRNA) binding domain; the left-handed Z-DNA-binding activity of the Zalpha domain of E3L is required for viral pathogenicity in mice. E3L is highly conserved among poxviruses, including the smallpox virus, and it is likely that the orthologous Zalpha domains play similar roles. To better understand the biological function of E3L proteins, we have investigated the Z-DNA-binding behavior of five representative Zalpha domains from poxviruses. Using surface plasmon resonance (SPR), we have demonstrated that these viral Zalpha domains bind Z-DNA tightly. Ability of Zalpha(E3L) converting B-DNA to Z-DNA was measured by circular dichroism (CD). The extents to which these Zalphas can stabilize Z-DNA vary considerably. Mutational studies demonstrate that residues in the loop of the beta-wing play an important role in this stabilization. Notably the Zalpha domain of vaccinia E3L acquires ability to convert B-DNA to Z-DNA by mutating amino acid residues in this region. Differences in the host cells of the various poxviruses may require different abilities to stabilize Z-DNA; this may be reflected in the observed differences in behavior in these Zalpha proteins. [Abstract/Link to Full Text]

Harris SA, Laughton CA, Liverpool TB
Mapping the phase diagram of the writhe of DNA nanocircles using atomistic molecular dynamics simulations.
Nucleic Acids Res. 2007 Nov 5;
We have investigated the effects of duplex length, sequence, salt concentration and superhelical density on the conformation of DNA nanocircles containing up to 178 base pairs using atomistic molecular dynamics simulation. These calculations reveal that the partitioning of twist and writhe is governed by a delicate balance of competing energetic terms. We have identified conditions which favour circular, positively or negatively writhed and denatured DNA conformations. Our simulations show that AT-rich DNA is more prone to denaturation when subjected to torsional stress than the corresponding GC containing circles. In contrast to the behaviour expected for a simple elastic rod, there is a distinct asymmetry in the behaviour of over and under-wound DNA nanocircles. The most biologically relevant negatively writhed state is more elusive than the corresponding positively writhed conformation, and is only observed for larger circles under conditions of high electrostatic screening. The simulation results have been summarised by plotting a phase diagram describing the various conformational states of nanocircles over the range of circle sizes and experimental conditions explored during the study. The changes in DNA structure that accompany supercoiling suggest a number of mechanisms whereby changes in DNA topology in vivo might be used to influence gene expression. [Abstract/Link to Full Text]

Gendron K, Charbonneau J, Dulude D, Heveker N, Ferbeyre G, Brakier-Gingras L
The presence of the TAR RNA structure alters the programmed -1 ribosomal frameshift efficiency of the human immunodeficiency virus type 1 (HIV-1) by modifying the rate of translation initiation.
Nucleic Acids Res. 2007 Nov 5;
HIV-1 uses a programmed -1 ribosomal frameshift to synthesize the precursor of its enzymes, Gag-Pol. The frameshift efficiency that is critical for the virus replication, is controlled by an interaction between the ribosome and a specific structure on the viral mRNA, the frameshift stimulatory signal. The rate of cap-dependent translation initiation is known to be altered by the TAR RNA structure, present at the 5' and 3' end of all HIV-1 mRNAs. Depending upon its concentration, TAR activates or inhibits the double-stranded RNA-dependent protein kinase (PKR). We investigated here whether changes in translation initiation caused by TAR affect HIV-1 frameshift efficiency. CD4+ T cells and 293T cells were transfected with a dual-luciferase construct where the firefly luciferase expression depends upon the HIV-1 frameshift. Translation initiation was altered by adding TAR in cis or trans of the reporter mRNA. We show that HIV-1 frameshift efficiency correlates negatively with changes in the rate of translation initiation caused by TAR and mediated by PKR. A model is presented where changes in the rate of initiation affect the probability of frameshifting by altering the distance between elongating ribosomes on the mRNA, which influences the frequency of encounter between these ribosomes and the frameshift stimulatory signal. [Abstract/Link to Full Text]

Ding G, Sun Y, Li H, Wang Z, Fan H, Wang C, Yang D, Li Y
EPGD: a comprehensive web resource for integrating and displaying eukaryotic paralog/paralogon information.
Nucleic Acids Res. 2007 Nov 5;
Gene duplication is common in all three domains of life, especially in eukaryotic genomes. The duplicates provide new material for the action of evolutionary forces such as selection or genetic drift. Here we describe a sophisticated procedure to extract duplicated genes (paralogs) from 26 available eukaryotic genomes, to pre-calculate several evolutionary indexes (evolutionary rate, synonymous distance/clock, transition redundant exchange clock, etc.) based on the paralog family, and to identify block or segmental duplications (paralogons). We also constructed an internet-accessible Eukaryotic Paralog Group Database (EPGD; http://epgd.biosino.org/EPGD/). The database is gene-centered and organized by paralog family. It focuses on paralogs and evolutionary duplication events. The paralog families and paralogons can be searched by text or sequence, and are downloadable from the website as plain text files. The database will be very useful for both experimentalists and bioinformaticians interested in the study of duplication events or paralog families. [Abstract/Link to Full Text]

Chaudhuri RR, Loman NJ, Snyder LA, Bailey CM, Stekel DJ, Pallen MJ
xBASE2: a comprehensive resource for comparative bacterial genomics.
Nucleic Acids Res. 2007 Nov 5;
xBASE is a genome database aimed at helping laboratory-based bacteriologists make best use of bacterial genome sequence data, with a particular emphasis on comparative genomics. The latest version, xBASE 2.0 (http://xbase.bham.ac.uk), now provides comprehensive coverage of all bacterial genomes and features an updated modularized backend and an improved user interface, which includes a taxonomy browser and a powerful full-text search facility. [Abstract/Link to Full Text]

Chen PH, Tsao YP, Wang CC, Chen SL
Nuclear receptor interaction protein, a coactivator of androgen receptors (AR), is regulated by AR and Sp1 to feed forward and activate its own gene expression through AR protein stability.
Nucleic Acids Res. 2007 Nov 5;
Previously, we found a novel gene, nuclear receptor interaction protein (NRIP), a transcription cofactor that can enhance an AR-driven PSA promoter activity in a ligand-dependent manner in prostate cancer cells. Here, we investigated NRIP regulation. We cloned a 413-bp fragment from the transcription initiation site of the NRIP gene that had strong promoter activity, was TATA-less and GC-rich, and, based on DNA sequences, contained one androgen response element (ARE) and three Sp1-binding sites (Sp1-1, Sp1-2, Sp1-3). Transient promoter luciferase assays, chromatin immunoprecipitation and small RNA interference analyses mapped ARE and Sp1-2-binding sites involved in NRIP promoter activation, implying that NRIP is a target gene for AR or Sp1. AR associates with the NRIP promoter through ARE and indirectly through Sp1-binding site via AR-Sp1 complex formation. Thus both ARE and Sp1-binding site within the NRIP promoter can respond to androgen induction. More intriguingly, NRIP plays a feed-forward role enhancing AR-driven NRIP promoter activity via NRIP forming a complex with AR to protect AR protein from proteasome degradation. This is the first demonstration that NRIP is a novel AR-target gene and that NRIP expression feeds forward and activates its own expression through AR protein stability. [Abstract/Link to Full Text]

Masih PJ, Kunnev D, Melendy T
Mismatch Repair proteins are recruited to replicating DNA through interaction with Proliferating Cell Nuclear Antigen (PCNA).
Nucleic Acids Res. 2007 Nov 5;
Mismatch Repair (MMR) is closely linked to DNA replication; however, other than the role of the replicative sliding clamp (PCNA) in various MMR functions, the linkage between DNA replication and MMR has been difficult to investigate. Here we use an in vitro DNA replication system based on simian virus 40, to investigate MMR recruitment to replicating DNA. Both DNA replication and MMR proteins are recruited to replicating DNA in an origin-dependent fashion. Primer synthesis is required for recruitment of both PCNA and MMR proteins, but not for recruitment of the single-stranded DNA-binding protein (RPA). Blocking PCNA recruitment to replicating DNA with a p21-based polypeptide blocks PCNA and MMR, but not RPA recruitment. Once PCNA and subsequent proteins required for replication are loaded onto DNA, addition of p21 leaves PCNA on the replicating DNA, but actively displaces MMR proteins. These findings indicate that the MMR machinery is recruited to replicating DNA through its interaction with PCNA, and suggests that this occurs via binding of the MMR proteins to the multi-protein interaction sites on PCNA. These studies demonstrate the utility of this system for further investigation of the role of DNA replication in MMR. [Abstract/Link to Full Text]

Qin Y, Rezler EM, Gokhale V, Sun D, Hurley LH
Characterization of the G-quadruplexes in the duplex nuclease hypersensitive element of the PDGF-A promoter and modulation of PDGF-A promoter activity by TMPyP4.
Nucleic Acids Res. 2007 Nov 5;
The proximal 5'-flanking region of the human platelet-derived growth factor A (PDGF-A) promoter contains one nuclease hypersensitive element (NHE) that is critical for PDGF-A gene transcription. On the basis of circular dichroism (CD) and electrophoretic mobility shift assay (EMSA), we have shown that the guanine-rich (G-rich) strand of the DNA in this region can form stable intramolecular parallel G-quadruplexes under physiological conditions. A Taq polymerase stop assay has shown that the G-rich strand of the NHE can form two major G-quadruplex structures, which are in dynamic equilibrium and differentially stabilized by three G-quadruplex-interactive drugs. One major parallel G-quadruplex structure of the G-rich strand DNA of NHE was identified by CD and dimethyl sulfate (DMS) footprinting. Surprisingly, CD spectroscopy shows a stable parallel G-quadruplex structure formed within the duplex DNA of the NHE at temperatures up to 100 degrees C. This structure has been characterized by DMS footprinting in the double-stranded DNA of the NHE. In transfection experiments, 10 muM TMPyP4 reduced the activity of the basal promoter of PDGF-A approximately 40%, relative to the control. On the basis of these results, we have established that ligand-mediated stabilization of G-quadruplex structures within the PDGF-A NHE can silence PDGF-A expression. [Abstract/Link to Full Text]

Tagawa M, Shohda KI, Fujimoto K, Sugawara T, Suyama A
Heat-resistant DNA tile arrays constructed by template-directed photoligation through 5-carboxyvinyl-2'-deoxyuridine.
Nucleic Acids Res. 2007 Nov 3;
Template-directed DNA photoligation has been applied to a method to construct heat-resistant two-dimensional (2D) DNA arrays that can work as scaffolds in bottom-up assembly of functional biomolecules and nano-electronic components. DNA double-crossover AB-staggered (DXAB) tiles were covalently connected by enzyme-free template-directed photoligation, which enables a specific ligation reaction in an extremely tight space and under buffer conditions where no enzymes work efficiently. DNA nanostructures created by self-assembly of the DXAB tiles before and after photoligation have been visualized by high-resolution, tapping mode atomic force microscopy in buffer. The improvement of the heat tolerance of 2D DNA arrays was confirmed by heating and visualizing the DNA nanostructures. The heat-resistant DNA arrays may expand the potential of DNA as functional materials in biotechnology and nanotechnology. [Abstract/Link to Full Text]

Tourasse NJ, Kolst� AB
SuperCAT: a supertree database for combined and integrative multilocus sequence typing analysis of the Bacillus cereus group of bacteria (including B. cereus, B. anthracis and B. thuringiensis).
Nucleic Acids Res. 2007 Nov 3;
The Bacillus cereus group of bacteria is an important group including mammalian and insect pathogens, such as B. anthracis, the anthrax bacterium, B. thuringiensis, used as a biological pesticide and B. cereus, often involved in food poisoning incidents. To characterize the population structure and epidemiology of these bacteria, five separate multilocus sequence typing (MLST) schemes have been developed, which makes results difficult to compare. Therefore, we have developed a database that compiles and integrates MLST data from all five schemes for the B. cereus group, accessible at http://mlstoslo.uio.no/. Supertree techniques were used to combine the phylogenetic information from analysis of all schemes and datasets, in order to produce an integrated view of the B. cereus group population. The database currently contains strain information and sequence data for 1029 isolates and 26 housekeeping gene fragments, which can be searched by keywords, MLST scheme, or sequence similarity. Supertrees can be browsed according to various criteria such as species, isolate source, or genetic distance, and subtrees containing strains of interest can be extracted. Besides analysis of the available data, the user has the possibility to enter her/his own sequences and compare them to the database and/or include them into the supertree reconstructions. [Abstract/Link to Full Text]

Matsuya A, Sakate R, Kawahara Y, Koyanagi KO, Sato Y, Fujii Y, Yamasaki C, Habara T, Nakaoka H, Todokoro F, Yamaguchi K, Endo T, Oota S, Makalowski W, Ikeo K, Suzuki Y, Hanada K, Hashimoto K, Hirai M, Iwama H, Saitou N, Hiraki AT, Jin L, Kaneko Y, Kanno M, Murakami K, Noda AO, Saichi N, Sanbonmatsu R, Suzuki M, Takeda JI, Tanaka M, Gojobori T, Imanishi T, Itoh T
Evola: Ortholog database of all human genes in H-InvDB with manual curation of phylogenetic trees.
Nucleic Acids Res. 2007 Nov 3;
Orthologs are genes in different species that evolved from a common ancestral gene by speciation. Currently, with the rapid growth of transcriptome data of various species, more reliable orthology information is prerequisite for further studies. However, detection of orthologs could be erroneous if pairwise distance-based methods, such as reciprocal BLAST searches, are utilized. Thus, as a sub-database of H-InvDB, an integrated database of annotated human genes (http://h-invitational.jp/), we constructed a fully curated database of evolutionary features of human genes, called 'Evola'. In the process of the ortholog detection, computational analysis based on conserved genome synteny and transcript sequence similarity was followed by manual curation by researchers examining phylogenetic trees. In total, 18 968 human genes have orthologs among 11 vertebrates (chimpanzee, mouse, cow, chicken, zebrafish, etc.), either computationally detected or manually curated orthologs. Evola provides amino acid sequence alignments and phylogenetic trees of orthologs and homologs. In 'd(N)/d(S) view', natural selection on genes can be analyzed between human and other species. In 'Locus maps', all transcript variants and their exon/intron structures can be compared among orthologous gene loci. We expect the Evola to serve as a comprehensive and reliable database to be utilized in comparative analyses for obtaining new knowledge about human genes. Evola is available at http://www.h-invitational.jp/evola/. [Abstract/Link to Full Text]

Hong EL, Balakrishnan R, Dong Q, Christie KR, Park J, Binkley G, Costanzo MC, Dwight SS, Engel SR, Fisk DG, Hirschman JE, Hitz BC, Krieger CJ, Livstone MS, Miyasato SR, Nash RS, Oughtred R, Skrzypek MS, Weng S, Wong ED, Zhu KK, Dolinski K, Botstein D, Cherry JM
Gene Ontology annotations at SGD: new data sources and annotation methods.
Nucleic Acids Res. 2007 Nov 3;
The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org/) collects and organizes biological information about the chromosomal features and gene products of the budding yeast Saccharomyces cerevisiae. Although published data from traditional experimental methods are the primary sources of evidence supporting Gene Ontology (GO) annotations for a gene product, high-throughput experiments and computational predictions can also provide valuable insights in the absence of an extensive body of literature. Therefore, GO annotations available at SGD now include high-throughput data as well as computational predictions provided by the GO Annotation Project (GOA UniProt; http://www.ebi.ac.uk/GOA/). Because the annotation method used to assign GO annotations varies by data source, GO resources at SGD have been modified to distinguish data sources and annotation methods. In addition to providing information for genes that have not been experimentally characterized, GO annotations from independent sources can be compared to those made by SGD to help keep the literature-based GO annotations current. [Abstract/Link to Full Text]

Recent Articles in Genome Research

Ichiyanagi K, Nakajima R, Kajikawa M, Okada N
Novel retrotransposon analysis reveals multiple mobility pathways dictated by hosts.
Genome Res. 2007 Jan;17(1):33-41.
Autonomous non-long-terminal-repeat retrotransposons (NLRs) proliferate by retrotransposition via coordinated reactions of target DNA cleavage and reverse transcription by a mechanism called target-primed reverse transcription (TPRT). Whereas this mechanism guarantees the covalent attachment of the NLR and its target site at the 3' junction, mechanisms for the joining at the 5' junction have been conjectural. To better understand the retrotransposition pathways, we analyzed target-NLR junctions of zebrafish NLRs with a new method of identifying genomic copies that reside within other transposons, termed "target analysis of nested transposons" (TANT). Application of the TANT method revealed various features of the zebrafish NLR integrants; for example, half of the integrants carry extra nucleotides at the 5' junction, which is in stark contrast to the major human NLR, LINE-1. Interestingly, in a cell culture assay, retrotransposition of the zebrafish NLR in heterologous human cells did not bear extra 5' nucleotides, indicating that the choice of the 5' joining pathway is affected by the host. Our results suggest that several pathways exist for NLR retrotransposition and argue in favor of host protein involvement. With genomic sequence information accumulating exponentially, our data demonstrate the general applicability of the TANT method for the analysis of a wide variety of retrotransposons. [Abstract/Link to Full Text]

Paschou P, Mahoney MW, Javed A, Kidd JR, Pakstis AJ, Gu S, Kidd KK, Drineas P
Intra- and interpopulation genotype reconstruction from tagging SNPs.
Genome Res. 2007 Jan;17(1):96-107.
The optimal method to be used for tSNP selection, the applicability of a reference LD map to unassayed populations, and the scalability of these methods to genome-wide analysis, all remain subjects of debate. We propose novel, scalable matrix algorithms that address these issues and we evaluate them on genotypic data from 38 populations and four genomic regions (248 SNPs typed for approximately 2000 individuals). We also evaluate these algorithms on a second data set consisting of genotypes available from the HapMap database (1336 SNPs for four populations) over the same genomic regions. Furthermore, we test these methods in the setting of a real association study using a publicly available family data set. The algorithms we use for tSNP selection and unassayed SNP reconstruction do not require haplotype inference and they are, in principle, scalable even to genome-wide analysis. Moreover, they are greedy variants of recently developed matrix algorithms with provable performance guarantees. Using a small set of carefully selected tSNPs, we achieve very good reconstruction accuracy of "untyped" genotypes for most of the populations studied. Additionally, we demonstrate in a quantitative manner that the chosen tSNPs exhibit substantial transferability, both within and across different geographic regions. Finally, we show that reconstruction can be applied to retrieve significant SNP associations with disease, with important genotyping savings. [Abstract/Link to Full Text]

Forster AC, Church GM
Synthetic biology projects in vitro.
Genome Res. 2007 Jan;17(1):1-6.
Advances in the in vitro synthesis and evolution of DNA, RNA, and polypeptides are accelerating the construction of biopolymers, pathways, and organisms with novel functions. Known functions are being integrated and debugged with the aim of synthesizing life-like systems. The goals are knowledge, tools, smart materials, and therapies. [Abstract/Link to Full Text]

Normand P, Lapierre P, Tisa LS, Gogarten JP, Alloisio N, Bagnarol E, Bassi CA, Berry AM, Bickhart DM, Choisne N, Couloux A, Cournoyer B, Cruveiller S, Daubin V, Demange N, Francino MP, Goltsman E, Huang Y, Kopp OR, Labarre L, Lapidus A, Lavire C, Marechal J, Martinez M, Mastronunzio JE, Mullin BC, Niemann J, Pujic P, Rawnsley T, Rouy Z, Schenowitz C, Sellstedt A, Tavares F, Tomkins JP, Vallenet D, Valverde C, Wall LG, Wang Y, Medigue C, Benson DR
Genome characteristics of facultatively symbiotic Frankia sp. strains reflect host range and host plant biogeography.
Genome Res. 2007 Jan;17(1):7-15.
Soil bacteria that also form mutualistic symbioses in plants encounter two major levels of selection. One occurs during adaptation to and survival in soil, and the other occurs in concert with host plant speciation and adaptation. Actinobacteria from the genus Frankia are facultative symbionts that form N(2)-fixing root nodules on diverse and globally distributed angiosperms in the "actinorhizal" symbioses. Three closely related clades of Frankia sp. strains are recognized; members of each clade infect a subset of plants from among eight angiosperm families. We sequenced the genomes from three strains; their sizes varied from 5.43 Mbp for a narrow host range strain (Frankia sp. strain HFPCcI3) to 7.50 Mbp for a medium host range strain (Frankia alni strain ACN14a) to 9.04 Mbp for a broad host range strain (Frankia sp. strain EAN1pec.) This size divergence is the largest yet reported for such closely related soil bacteria (97.8%-98.9% identity of 16S rRNA genes). The extent of gene deletion, duplication, and acquisition is in concert with the biogeographic history of the symbioses and host plant speciation. Host plant isolation favored genome contraction, whereas host plant diversification favored genome expansion. The results support the idea that major genome expansions as well as reductions can occur in facultative symbiotic soil bacteria as they respond to new environments in the context of their symbioses. [Abstract/Link to Full Text]

Freyhult EK, Bollback JP, Gardner PP
Exploring genomic dark matter: a critical assessment of the performance of homology search methods on noncoding RNA.
Genome Res. 2007 Jan;17(1):117-25.
Homology search is one of the most ubiquitous bioinformatic tasks, yet it is unknown how effective the currently available tools are for identifying noncoding RNAs (ncRNAs). In this work, we use reliable ncRNA data sets to assess the effectiveness of methods such as BLAST, FASTA, HMMer, and Infernal. Surprisingly, the most popular homology search methods are often the least accurate. As a result, many studies have used inappropriate tools for their analyses. On the basis of our results, we suggest homology search strategies using the currently available tools and some directions for future development. [Abstract/Link to Full Text]

Hicks J, Krasnitz A, Lakshmi B, Navin NE, Riggs M, Leibu E, Esposito D, Alexander J, Troge J, Grubor V, Yoon S, Wigler M, Ye K, B�rresen-Dale AL, Naume B, Schlicting E, Norton L, H�gerstr�m T, Skoog L, Auer G, M�n�r S, Lundin P, Zetterberg A
Novel patterns of genome rearrangement and their association with survival in breast cancer.
Genome Res. 2006 Dec;16(12):1465-79.
Representational Oligonucleotide Microarray Analysis (ROMA) detects genomic amplifications and deletions with boundaries defined at a resolution of approximately 50 kb. We have used this technique to examine 243 breast tumors from two separate studies for which detailed clinical data were available. The very high resolution of this technology has enabled us to identify three characteristic patterns of genomic copy number variation in diploid tumors and to measure correlations with patient survival. One of these patterns is characterized by multiple closely spaced amplicons, or "firestorms," limited to single chromosome arms. These multiple amplifications are highly correlated with aggressive disease and poor survival even when the rest of the genome is relatively quiet. Analysis of a selected subset of clinical material suggests that a simple genomic calculation, based on the number and proximity of genomic alterations, correlates with life-table estimates of the probability of overall survival in patients with primary breast cancer. Based on this sample, we generate the working hypothesis that copy number profiling might provide information useful in making clinical decisions, especially regarding the use or not of systemic therapies (hormonal therapy, chemotherapy), in the management of operable primary breast cancer with ostensibly good prognosis, for example, small, node-negative, hormone-receptor-positive diploid cases. [Abstract/Link to Full Text]

Khattra J, Delaney AD, Zhao Y, Siddiqui A, Asano J, McDonald H, Pandoh P, Dhalla N, Prabhu AL, Ma K, Lee S, Ally A, Tam A, Sa D, Rogers S, Charest D, Stott J, Zuyderduyn S, Varhol R, Eaves C, Jones S, Holt R, Hirst M, Hoodless PA, Marra MA
Large-scale production of SAGE libraries from microdissected tissues, flow-sorted cells, and cell lines.
Genome Res. 2007 Jan;17(1):108-16.
We describe the details of a serial analysis of gene expression (SAGE) library construction and analysis platform that has enabled the generation of >298 high-quality SAGE libraries and >30 million SAGE tags primarily from sub-microgram amounts of total RNA purified from samples acquired by microdissection. Several RNA isolation methods were used to handle the diversity of samples processed, and various measures were applied to minimize ditag PCR carryover contamination. Modifications in the SAGE protocol resulted in improved cloning and DNA sequencing efficiencies. Bioinformatic measures to automatically assess DNA sequencing results were implemented to analyze the integrity of ditag structure, linker or cross-species ditag contamination, and yield of high-quality tags per sequence read. Our analysis of singleton tag errors resulted in a method for correcting such errors to statistically determine tag accuracy. From the libraries generated, we produced an essentially complete mapping of reliable 21-base-pair tags to the mouse reference genome sequence for a meta-library of approximately 5 million tags. Our analyses led us to reject the commonly held notion that duplicate ditags are artifacts. Rather than the usual practice of discarding such tags, we conclude that they should be retained to avoid introducing bias into the results and thereby maintain the quantitative nature of the data, which is a major theoretical advantage of SAGE as a tool for global transcriptional profiling. [Abstract/Link to Full Text]

Forton JT, Udalova IA, Campino S, Rockett KA, Hull J, Kwiatkowski DP
Localization of a long-range cis-regulatory element of IL13 by allelic transcript ratio mapping.
Genome Res. 2007 Jan;17(1):82-7.
It appears that, for many genes, the two alleles possessed by an individual may produce different amounts of transcript. When such allelic differences in transcription are observed for some individuals but not others, a plausible explanation is genetic variation in the cis-acting elements that regulate the gene in question. Here we describe a novel analytical approach that uses such observations, combined with genotyping data from the HapMap project, to define the genomic location of cis-acting regulatory elements. When applied to the human 5q31 chromosomal region, where complex regulatory mechanisms are known to exist, we demonstrate the sensitivity of this approach by locating a highly significant cis-regulatory element operating on IL13 at long range from a position 250 kb upstream from the gene (P = 2 x 10(-6)). As this method is unaffected by other sources of variation, such as environmental and trans-acting genetic factors, it provides a tractable approach for dissecting the complexities of genetic variation in gene regulation. [Abstract/Link to Full Text]

Roh TY, Wei G, Farrell CM, Zhao K
Genome-wide prediction of conserved and nonconserved enhancers by histone acetylation patterns.
Genome Res. 2007 Jan;17(1):74-81.
Comparative genomic studies have been useful in identifying transcriptional regulatory elements in higher eukaryotic genomes, but many important regulatory elements cannot be detected by such analyses due to evolutionary variations and alignment tool limitations. Therefore, in this study we exploit the highly conserved nature of epigenetic modifications to identify potential transcriptional enhancers. By using a high-resolution genome-wide mapping technique, which combines the chromatin immunoprecipitation and serial analysis of gene expression assays, we have recently determined the distribution of lysine 9/14-diacetylated histone H3 in human T cells. We showed the existence of 46,813 regions with clusters of histone acetylation, termed histone acetylation islands, some of which correspond to known transcriptional regulatory elements. In the present study, we find that 4679 sequences conserved between human and pufferfish coincide with histone acetylation islands, and random sampling shows that 33% (13/39) of these can function as transcriptional enhancers in human Jurkat T cells. In addition, by comparing the human histone acetylation island sequences with mouse genome sequences, we find that despite the conservation of many of these regions between these species, 21,855 of these sequences are not conserved. Furthermore, we demonstrate that about 50% (26/51) of these nonconserved sequences have enhancer activity in Jurkat cells, and that many of the orthologous mouse sequences also have enhancer activity in addition to conserved epigenetic modification patterns in mouse T-cell chromatin. Therefore, by combining epigenetic modification and sequence data, we have established a novel genome-wide method for identifying regulatory elements not discernable by comparative genomics alone. [Abstract/Link to Full Text]

Fiegler H, Redon R, Andrews D, Scott C, Andrews R, Carder C, Clark R, Dovey O, Ellis P, Feuk L, French L, Hunt P, Kalaitzopoulos D, Larkin J, Montgomery L, Perry GH, Plumb BW, Porter K, Rigby RE, Rigler D, Valsesia A, Langford C, Humphray SJ, Scherer SW, Lee C, Hurles ME, Carter NP
Accurate and reliable high-throughput detection of copy number variation in the human genome.
Genome Res. 2006 Dec;16(12):1566-74.
This study describes a new tool for accurate and reliable high-throughput detection of copy number variation in the human genome. We have constructed a large-insert clone DNA microarray covering the entire human genome in tiling path resolution that we have used to identify copy number variation in human populations. Crucial to this study has been the development of a robust array platform and analytic process for the automated identification of copy number variants (CNVs). The array consists of 26,574 clones covering 93.7% of euchromatic regions. Clones were selected primarily from the published "Golden Path," and mapping was confirmed by fingerprinting and BAC-end sequencing. Array performance was extensively tested by a series of validation assays. These included determining the hybridization characteristics of each individual clone on the array by chromosome-specific add-in experiments. Estimation of data reproducibility and false-positive/negative rates was carried out using self-self hybridizations, replicate experiments, and independent validations of CNVs. Based on these studies, we developed a variance-based automatic copy number detection analysis process (CNVfinder) and have demonstrated its robustness by comparison with the SW-ARRAY method. [Abstract/Link to Full Text]

Komura D, Shen F, Ishikawa S, Fitch KR, Chen W, Zhang J, Liu G, Ihara S, Nakamura H, Hurles ME, Lee C, Scherer SW, Jones KW, Shapero MH, Huang J, Aburatani H
Genome-wide detection of human copy number variations using high-density DNA oligonucleotide arrays.
Genome Res. 2006 Dec;16(12):1575-84.
Recent reports indicate that copy number variations (CNVs) within the human genome contribute to nucleotide diversity to a larger extent than single nucleotide polymorphisms (SNPs). In addition, the contribution of CNVs to human disease susceptibility may be greater than previously expected, although a complete understanding of the phenotypic consequences of CNVs is incomplete. We have recently reported a comprehensive view of CNVs among 270 HapMap samples using high-density SNP genotyping arrays and BAC array CGH. In this report, we describe a novel algorithm using Affymetrix GeneChip Human Mapping 500K Early Access (500K EA) arrays that identified 1203 CNVs ranging in size from 960 bp to 3.4 Mb. The algorithm consists of three steps: (1) Intensity pre-processing to improve the resolution between pairwise comparisons by directly estimating the allele-specific affinity as well as to reduce signal noise by incorporating probe and target sequence characteristics via an improved version of the Genomic Imbalance Map (GIM) algorithm; (2) CNV extraction using an adapted SW-ARRAY procedure to automatically and robustly detect candidate CNV regions; and (3) copy number inference in which all pairwise comparisons are summarized to more precisely define CNV boundaries and accurately estimate CNV copy number. Independent testing of a subset of CNVs by quantitative PCR and mass spectrometry demonstrated a >90% verification rate. The use of high-resolution oligonucleotide arrays relative to other methods may allow more precise boundary information to be extracted, thereby enabling a more accurate analysis of the relationship between CNVs and other genomic features. [Abstract/Link to Full Text]

Emanuelsson O, Nagalakshmi U, Zheng D, Rozowsky JS, Urban AE, Du J, Lian Z, Stolc V, Weissman S, Snyder M, Gerstein MB
Assessing the performance of different high-density tiling microarray strategies for mapping transcribed regions of the human genome.
Genome Res. 2007 Jun;17(6):886-97.
Genomic tiling microarrays have become a popular tool for interrogating the transcriptional activity of large regions of the genome in an unbiased fashion. There are several key parameters associated with each tiling experiment (e.g., experimental protocols and genomic tiling density). Here, we assess the role of these parameters as they are manifest in different tiling-array platforms used for transcription mapping. First, we analyze how a number of published tiling-array experiments agree with established gene annotation on human chromosome 22. We observe that the transcription detected from high-density arrays correlates substantially better with annotation than that from other array types. Next, we analyze the transcription-mapping performance of the two main high-density oligonucleotide array platforms in the ENCODE regions of the human genome. We hybridize identical biological samples and develop several ways of scoring the arrays and segmenting the genome into transcribed and nontranscribed regions, with the aim of making the platforms most comparable to each other. Finally, we develop a platform comparison approach based on agreement with known annotation. Overall, we find that the performance improves with more data points per locus, coupled with statistical scoring approaches that properly take advantage of this, where this larger number of data points arises from higher genomic tiling density and the use of replicate arrays and mismatches. While we do find significant differences in the performance of the two high-density platforms, we also find that they complement each other to some extent. Finally, our experiments reveal a significant amount of novel transcription outside of known genes, and an appreciable sample of this was validated by independent experiments. [Abstract/Link to Full Text]

Coulombe-Huntington J, Majewski J
Characterization of intron loss events in mammals.
Genome Res. 2007 Jan;17(1):23-32.
The exon/intron structure of eukaryotic genes differs extensively across species, but the mechanisms and relative rates of intron loss and gain are still poorly understood. Here, we used whole-genome sequence alignments of human, mouse, rat, and dog to perform a genome-wide analysis of intron loss and gain events in >17,000 mammalian genes. We found no evidence for intron gain and 122 cases of intron loss, most of which occurred within the rodent lineage. The majority (68%) of the deleted introns were extremely small (<150 bp), significantly smaller than average. The intron losses occurred almost exclusively within highly expressed, housekeeping genes, supporting the hypothesis that intron loss is mediated via germline recombination of genomic DNA with intronless cDNA. This study constitutes the largest scale analysis for intron dynamics in vertebrates to date and allows us to confirm and extend several hypotheses previously based on much smaller samples. Our results in mammals show that intron gain has not been a factor in the evolution of gene structure during the past 95 Myr and has likely been restricted to more ancient history. [Abstract/Link to Full Text]

Emrich SJ, Barbazuk WB, Li L, Schnable PS
Gene discovery and annotation using LCM-454 transcriptome sequencing.
Genome Res. 2007 Jan;17(1):69-73.
454 DNA sequencing technology achieves significant throughput relative to traditional approaches. More than 261,000 ESTs were generated by 454 Life Sciences from cDNA isolated using laser capture microdissection (LCM) from the developmentally important shoot apical meristem (SAM) of maize (Zea mays L.). This single sequencing run annotated >25,000 maize genomic sequences and also captured approximately 400 expressed transcripts for which homologous sequences have not yet been identified in other species. Approximately 70% of the ESTs generated in this study had not been captured during a previous EST project conducted using a cDNA library constructed from hand-dissected apex tissue that is highly enriched for SAMs. In addition, at least 30% of the 454-ESTs do not align to any of the approximately 648,000 extant maize ESTs using conservative alignment criteria. These results indicate that the combination of LCM and the deep sequencing possible with 454 technology enriches for SAM transcripts not present in current EST collections. RT-PCR was used to validate the expression of 27 genes whose expression had been detected in the SAM via LCM-454 technology, but that lacked orthologs in GenBank. Significantly, transcripts from approximately 74% (20/27) of these validated SAM-expressed "orphans" were not detected in meristem-rich immature ears. We conclude that the coupling of LCM and 454 sequencing technologies facilitates the discovery of rare, possibly cell-type-specific transcripts. [Abstract/Link to Full Text]

Jehan Z, Vallinayagam S, Tiwari S, Pradhan S, Singh L, Suresh A, Reddy HM, Ahuja YR, Jesudasan RA
Novel noncoding RNA from human Y distal heterochromatic block (Yq12) generates testis-specific chimeric CDC2L2.
Genome Res. 2007 Apr;17(4):433-40.
The human Y chromosome, because it is enriched in repetitive DNA, has been very intractable to genetic and molecular analyses. There is no previous evidence for developmental stage- and testis-specific transcription from the male-specific region of the Y (MSY). Here, we present evidence for the first time for a developmental stage- and testis-specific transcription from MSY distal heterochromatic block. We isolated two novel RNAs, which localize to Yq12 in multiple copies, show testis-specific expression, and lack active X-homologs. Experimental evidence shows that one of the above Yq12 noncoding RNAs (ncRNAs) trans-splices with CDC2L2 mRNA from chromosome 1p36.3 locus to generate a testis-specific chimeric beta sv13 isoform. This 67-nt 5'UTR provided by the Yq12 transcript contains within it a Y box protein-binding CCAAT motif, indicating translational regulation of the beta sv13 isoform in testis. This is also the first report of trans-splicing between a Y chromosomal and an autosomal transcript. [Abstract/Link to Full Text]

Chen FC, Chen CJ, Li WH, Chuang TJ
Human-specific insertions and deletions inferred from mammalian genome sequences.
Genome Res. 2007 Jan;17(1):16-22.
It has been suggested that insertions and deletions (indels) have contributed to the sequence divergence between the human and chimpanzee genomes more than do nucleotide changes (3% vs. 1.2%). However, although there have been studies of large indels between the two genomes, no systematic analysis of small indels (i.e., indels </= 100 bp) has been published. In this study, we first estimated that the false-positive rate of small indels inferred from human-chimpanzee pairwise sequence alignments is quite high, suggesting that the chimpanzee genome draft is not sufficiently accurate for our purpose. We have therefore inferred only human-specific indels using multiple sequence alignments of mammalian genomes. We identified >840,000 "small" indels, which affect >7000 UCSC-annotated human genes (>11,000 transcripts). These indels, however, amount to only approximately 0.21% sequence change in the human lineage for the regions compared, whereas in pseudogenes indels contribute to a sequence divergence of 1.40%, suggesting that most of the indels that occurred in genic regions have been eliminated. Functional analysis reveals that the genes whose coding exons have been affected by human-specific indels are enriched in transcription and translation regulatory activities but are underrepresented in catalytic and transporter activities, cellular and physiological processes, and extracellular region/matrix. This functional bias suggests that human-specific indels might have contributed to human unique traits by causing changes at the RNA and protein level. [Abstract/Link to Full Text]

Eyheramendy S, Marchini J, McVean G, Myers S, Donnelly P
A model-based approach to capture genetic variation for future association studies.
Genome Res. 2007 Jan;17(1):88-95.
Genome-wide association studies are still constrained by the cost of genotyping. For this reason, the selection of a reduced set of markers or tags able to capture a significant proportion of the genetic variation is an important aspect of these studies. Most tagging SNP selection methods have been successful in capturing the genetic variation of the data from which the tags have been chosen. However, when these tags are used in an independent data set, a significant proportion of the remaining SNPs (non-tags) are not captured and, in most cases, there is no information on which SNPs are captured. We propose to use a probabilistic model to predict the non-tags based on a set of tags, as a way to capture genetic variation. An important advantage of this method is that it directly predicts the genotype of the non-tags with which we can test for association with the phenotype and which could help to elucidate the location of genes responsible for increasing disease susceptibility. Additionally, this method provides an estimate of the probabilities with which the predictions are made, which reflects the confidence of the probabilistic model. We also propose new methods to select the tagging SNPs. We empirically show by using HapMap data that our approach is able to capture significantly more genetic variation than methods based solely on a pairwise LD measure. [Abstract/Link to Full Text]

Didelot X, Achtman M, Parkhill J, Thomson NR, Falush D
A bimodal pattern of relatedness between the Salmonella Paratyphi A and Typhi genomes: convergence or divergence by homologous recombination?
Genome Res. 2007 Jan;17(1):61-8.
All Salmonella can cause disease but severe systemic infections are primarily caused by a few lineages. Paratyphi A and Typhi are the deadliest human restricted serovars, responsible for approximately 600,000 deaths per annum. We developed a Bayesian changepoint model that uses variation in the degree of nucleotide divergence along two genomes to detect homologous recombination between these strains, and with other lineages of Salmonella enterica. Paratyphi A and Typhi showed an atypical and surprising pattern. For three quarters of their genomes, they appear to be distantly related members of the species S. enterica, both in their gene content and nucleotide divergence. However, the remaining quarter is much more similar in both aspects, with average nucleotide divergence of 0.18% instead of 1.2%. We describe two different scenarios that could have led to this pattern, convergence and divergence, and conclude that the former is more likely based on a variety of criteria. The convergence scenario implies that, although Paratyphi A and Typhi were not especially close relatives within S. enterica, they have gone through a burst of recombination involving more than 100 recombination events. Several of the recombination events transferred novel genes in addition to homologous sequences, resulting in similar gene content in the two lineages. We propose that recombination between Typhi and Paratyphi A has allowed the exchange of gene variants that are important for their adaptation to their common ecological niche, the human host. [Abstract/Link to Full Text]

Gomes JP, Bruno WJ, Nunes A, Santos N, Florindo C, Borrego MJ, Dean D
Evolution of Chlamydia trachomatis diversity occurs by widespread interstrain recombination involving hotspots.
Genome Res. 2007 Jan;17(1):50-60.
Chlamydia trachomatis is an obligate intracellular bacterium of major public health significance, infecting over one-tenth of the world's population and causing blindness and infertility in millions. Mounting evidence supports recombination as a key source of genetic diversity among free-living bacteria. Previous research shows that intracellular bacteria such as Chlamydiaceae may also undergo recombination but whether this plays a significant evolutionary role has not been determined. Here, we examine multiple loci dispersed throughout the chromosome to determine the extent and significance of recombination among 19 laboratory reference strains and 10 present-day ocular and urogenital clinical isolates using phylogenetic reconstructions, compatibility matrices, and statistically based recombination programs. Recombination is widespread; all clinical isolates are recombinant at multiple loci with no two belonging to the same clonal lineage. Several reference strains show nonconcordant phylogenies across loci; one strain is unambiguously identified as recombinantly derived from other reference strain lineages. Frequent recombination contrasts with a low level of point substitution; novel substitutions relative to reference strains occur less than one per kilobase. Hotspots for recombination are identified downstream from ompA, which encodes the major outer membrane protein. This widespread recombination, unexpected for an intracellular bacterium, explains why strain-typing using one or two genes, such as ompA, does not correlate with clinical phenotypes. Our results do not point to specific events that are responsible for different pathogenicities but, instead, suggest a new approach to dissect the genetic basis for clinical strain pathology with implications for evolution, host cell adaptation, and emergence of new chlamydial diseases. [Abstract/Link to Full Text]

Dewannieux M, Harper F, Richaud A, Letzelter C, Ribet D, Pierron G, Heidmann T
Identification of an infectious progenitor for the multiple-copy HERV-K human endogenous retroelements.
Genome Res. 2006 Dec;16(12):1548-56.
Human Endogenous Retroviruses are expected to be the remnants of ancestral infections of primates by active retroviruses that have thereafter been transmitted in a Mendelian fashion. Here, we derived in silico the sequence of the putative ancestral "progenitor" element of one of the most recently amplified family - the HERV-K family - and constructed it. This element, Phoenix, produces viral particles that disclose all of the structural and functional properties of a bona-fide retrovirus, can infect mammalian, including human, cells, and integrate with the exact signature of the presently found endogenous HERV-K progeny. We also show that this element amplifies via an extracellular pathway involving reinfection, at variance with the non-LTR-retrotransposons (LINEs, SINEs) or LTR-retrotransposons, thus recapitulating ex vivo the molecular events responsible for its dissemination in the host genomes. We also show that in vitro recombinations among present-day human HERV-K (also known as ERVK) loci can similarly generate functional HERV-K elements, indicating that human cells still have the potential to produce infectious retroviruses. [Abstract/Link to Full Text]

Rocha EP, Touchon M, Feil EJ
Similar compositional biases are caused by very different mutational effects.
Genome Res. 2006 Dec;16(12):1537-47.
Compositional replication strand bias, commonly referred to as GC skew, is present in many genomes of prokaryotes, eukaryotes, and viruses. Although cytosine deamination in ssDNA (resulting in C-->T changes on the leading strand) is often invoked as its major cause, the precise contributions of this and other substitution types are currently unknown. It is also unclear if the underlying mutational asymmetries are the same among taxa, are stable over time, or how closely the observed biases are to mutational equilibrium. We analyzed nearly neutral sites of seven taxa each with between three and six complete bacterial genomes, and inferred the substitution spectra of fourfold degenerate positions in nonhighly expressed genes. Using a bootstrap procedure, we extracted compositional biases associated with replication and identified the significant asymmetries. Although all taxa showed an overrepresentation of G relative to C on the leading strand (and imbalances between A and T), widely variable substitution asymmetries are noted. Surprisingly, all substitution types show significant asymmetry in at least one taxon, but none were universally biased in all taxa. Notably, in the two most biased genomes, A-->G, rather than C-->T, shapes the compositional bias. Given the variability in these biases, we propose that the process is multifactorial. Finally, we also find that most genomes are not at compositional equilibrium, and suggest that mutational-based heterotachy is deeply imprinted in the history of biological macromolecules. This shows that similar compositional biases associated with the same essential well-conserved process, replication, do not reflect similar mutational processes in different genomes, and that caution is required in inferring the roles of specific mutational biases on the basis of contemporary patterns of sequence composition. [Abstract/Link to Full Text]

Jones AK, Raymond-Delpech V, Thany SH, Gauthier M, Sattelle DB
The nicotinic acetylcholine receptor gene family of the honey bee, Apis mellifera.
Genome Res. 2006 Nov;16(11):1422-30.
Nicotinic acetylcholine receptors (nAChRs) mediate fast cholinergic synaptic transmission and play roles in many cognitive processes. They are under intense research as potential targets of drugs used to treat neurodegenerative diseases and neurological disorders such as Alzheimer's disease and schizophrenia. Invertebrate nAChRs are targets of anthelmintics as well as a major group of insecticides, the neonicotinoids. The honey bee, Apis mellifera, is one of the most beneficial insects worldwide, playing an important role in crop pollination, and is also a valuable model system for studies on social interaction, sensory processing, learning, and memory. We have used the A. mellifera genome information to characterize the complete honey bee nAChR gene family. Comparison with the fruit fly Drosophila melanogaster and the malaria mosquito Anopheles gambiae shows that the honey bee possesses the largest family of insect nAChR subunits to date (11 members). As with Drosophila and Anopheles, alternative splicing of conserved exons increases receptor diversity. Also, we show that in one honey bee nAChR subunit, six adenosine residues are targeted for RNA A-to-I editing, two of which are evolutionarily conserved in Drosophila melanogaster and Heliothis virescens orthologs, and that the extent of editing increases as the honey bee lifecycle progresses, serving to maximize receptor diversity at the adult stage. These findings on Apis mellifera enhance our understanding of nAChR functional genomics and provide a useful basis for the development of improved insecticides that spare a major beneficial insect species. [Abstract/Link to Full Text]

Cho S, Huang ZY, Green DR, Smith DR, Zhang J
Evolution of the complementary sex-determination gene of honey bees: balancing selection and trans-species polymorphisms.
Genome Res. 2006 Nov;16(11):1366-75.
The mechanism of sex determination varies substantively among evolutionary lineages. One important mode of genetic sex determination is haplodiploidy, which is used by approximately 20% of all animal species, including >200,000 species of the entire insect order Hymenoptera. In the honey bee Apis mellifera, a hymenopteran model organism, females are heterozygous at the csd (complementary sex determination) locus, whereas males are hemizygous (from unfertilized eggs). Fertilized homozygotes develop into sterile males that are eaten before maturity. Because homozygotes have zero fitness and because common alleles are more likely than rare ones to form homozygotes, csd should be subject to strong overdominant selection and negative frequency-dependent selection. Under these selective forces, together known as balancing selection, csd is expected to exhibit a high degree of intraspecific polymorphism, with long-lived alleles that may be even older than the species. Here we sequence the csd genes as well as randomly selected neutral genomic regions from individuals of three closely related species, A. mellifera, Apis cerana, and Apis dorsata. The polymorphic level is approximately seven times higher in csd than in the neutral regions. Gene genealogies reveal trans-species polymorphisms at csd but not at any neutral regions. Consistent with the prediction of rare-allele advantage, nonsynonymous mutations are found to be positively selected in csd only in early stages after their appearances. Surprisingly, three different hypervariable repetitive regions in csd are present in the three species, suggesting variable mechanisms underlying allelic specificities. Our results provide a definitive demonstration of balancing selection acting at the honey bee csd gene, offer insights into the molecular determinants of csd allelic specificities, and help avoid homozygosity in bee breeding. [Abstract/Link to Full Text]

Kaplan N, Linial M
ProtoBee: hierarchical classification and annotation of the honey bee proteome.
Genome Res. 2006 Nov;16(11):1431-8.
The recently sequenced genome of the honey bee (Apis mellifera) has produced 10,157 predicted protein sequences, calling for a computational effort to extract biological insights from them. We have applied an unsupervised hierarchical protein-clustering method, which was previously used in the ProtoNet system, to nearly 200,000 proteins consisting of the predicted honey bee proteins, the SWISS-PROT protein database, and the complete set of proteins of the mouse (Mus musculus) and the fruit fly (Drosophila melanogaster). The hierarchy produced by this method has been entitled ProtoBee. In ProtoBee, the proteins are hierarchically organized into 18,936 separate tree hierarchies, each representing a protein functional family. By using the mouse and Drosophila complete proteomes as reference, we are able to highlight functional groups of putative gene-loss events, putative novel proteins of unique functionality, and bee-specific paralogs. We have studied some of the ProtoBee findings and suggest their biological relevance. Examples include novel opsin genes and intriguing nuclear matches of mitochondrial genes. The organization of bee sequences into functional clusters suggests a natural way of automatically inferring functional annotation. Following this notion, we were able to assign functional annotation to about 70% of the sequences. ProtoBee is available at http://www.protobee.cs.huji.ac.il. [Abstract/Link to Full Text]

Drapeau MD, Albert S, Kucharski R, Prusko C, Maleszka R
Evolution of the Yellow/Major Royal Jelly Protein family and the emergence of social behavior in honey bees.
Genome Res. 2006 Nov;16(11):1385-94.
The genomic architecture underlying the evolution of insect social behavior is largely a mystery. Eusociality, defined by overlapping generations, parental brood care, and reproductive division of labor, has most commonly evolved in the Hymenopteran insects, including the honey bee Apis mellifera. In this species, the Major Royal Jelly Protein (MRJP) family is required for all major aspects of eusocial behavior. Here, using data obtained from the A. mellifera genome sequencing project, we demonstrate that the MRJP family is encoded by nine genes arranged in an approximately 60-kb tandem array. Furthermore, the MRJP protein family appears to have evolved from a single progenitor gene that encodes a member of the ancient Yellow protein family. Five genes encoding Yellow-family proteins flank the genomic region containing the genes encoding MRJPs. We describe the molecular evolution of these protein families. We then characterize developmental-stage-specific, sex-specific, and caste-specific expression patterns of the mrjp and yellow genes in the honey bee. We review empirical evidence concerning the functions of Yellow proteins in fruit flies and social ants, in order to shed light on the roles of both Yellow and MRJP proteins in A. mellifera. In total, the available evidence suggests that Yellows and MRJPs are multifunctional proteins with diverse, context-dependent physiological and developmental roles. However, many members of the Yellow/MRJP family act as facilitators of reproductive maturation. Finally, it appears that MRJP protein subfamily evolution from the Yellow protein family may have coincided with the evolution of honey bee eusociality. [Abstract/Link to Full Text]

Sutherland TD, Campbell PM, Weisman S, Trueman HE, Sriskantha A, Wanjura WJ, Haritos VS
A highly divergent gene cluster in honey bees encodes a novel silk family.
Genome Res. 2006 Nov;16(11):1414-21.
The pupal cocoon of the domesticated silk moth Bombyx mori is the best known and most extensively studied insect silk. It is not widely known that Apis mellifera larvae also produce silk. We have used a combination of genomic and proteomic techniques to identify four honey bee fiber genes (AmelFibroin1-4) and two silk-associated genes (AmelSA1 and 2). The four fiber genes are small, comprise a single exon each, and are clustered on a short genomic region where the open reading frames are GC-rich amid low GC intergenic regions. The genes encode similar proteins that are highly helical and predicted to form unusually tight coiled coils. Despite the similarity in size, structure, and composition of the encoded proteins, the genes have low primary sequence identity. We propose that the four fiber genes have arisen from gene duplication events but have subsequently diverged significantly. The silk-associated genes encode proteins likely to act as a glue (AmelSA1) and involved in silk processing (AmelSA2). Although the silks of honey bees and silkmoths both originate in larval labial glands, the silk proteins are completely different in their primary, secondary, and tertiary structures as well as the genomic arrangement of the genes encoding them. This implies independent evolutionary origins for these functionally related proteins. [Abstract/Link to Full Text]

Robertson HM, Wanner KW
The chemoreceptor superfamily in the honey bee, Apis mellifera: expansion of the odorant, but not gustatory, receptor family.
Genome Res. 2006 Nov;16(11):1395-403.
The honey bee genome sequence reveals a remarkable expansion of the insect odorant receptor (Or) family relative to the repertoires of the flies Drosophila melanogaster and Anopheles gambiae, which have 62 and 79 Ors respectively. A total of 170 Or genes were annotated in the bee, of which seven are pseudogenes. These constitute five bee-specific subfamilies in an insect Or family tree, one of which has expanded to a total of 157 genes encoding proteins with 15%-99% amino acid identity. Most of the Or genes are in tandem arrays, including one with 60 genes. This bee-specific expansion of the Or repertoire presumably underlies their remarkable olfactory abilities, including perception of several pheromone blends, kin recognition signals, and diverse floral odors. The number of Apis mellifera Ors is approximately equal to the number of glomeruli in the bee antennal lobe (160-170), consistent with a general one-receptor/one-neuron/one-glomerulus relationship. The bee genome encodes just 10 gustatory receptors (Grs) compared with the D. melanogaster and A. gambiae repertoires of 68 and 76 Grs, respectively. A lack of Gr gene family expansion primarily accounts for this difference. A nurturing hive environment and a mutualistic relationship with plants may explain the lack of Gr family expansion. The Or family is the most dramatic example of gene family expansion in the bee genome, and characterizing their caste- and sex-specific gene expression may provide clues to their specific roles in detection of pheromone, kin, and floral odors. [Abstract/Link to Full Text]

For�t S, Maleszka R
Function and evolution of a gene family encoding odorant binding-like proteins in a social insect, the honey bee (Apis mellifera).
Genome Res. 2006 Nov;16(11):1404-13.
The remarkable olfactory power of insect species is thought to be generated by a combinatorial action of two large protein families, G protein-coupled olfactory receptors (ORs) and odorant binding proteins (OBPs). In olfactory sensilla, OBPs deliver hydrophobic airborne molecules to ORs, but their expression in nonolfactory tissues suggests that they also may function as general carriers in other developmental and physiological processes. Here we used bioinformatic and experimental approaches to characterize the OBP-like gene family in a highly social insect, the Western honey bee. Comparison with other insects shows that the honey bee has the smallest set of these genes, consisting of only 21 OBPs. This number stands in stark contrast to the more than 70 OBPs in Anopheles gambiae and 51 in Drosophila melanogaster. In the honey bee as in the two dipterans, these genes are organized in clusters. We show that the evolution of their structure involved frequent intron losses. We describe a monophyletic subfamily of OBPs where the diversification of some amino acids appears to have been accelerated by positive selection. Expression profiling under a wide range of conditions shows that in the honey bee only nine OBPs are antenna-specific. The remaining genes are expressed either ubiquitously or are tightly regulated in specialized tissues or during development. These findings support the view that OBPs are not restricted to olfaction and are likely to be involved in broader physiological functions. [Abstract/Link to Full Text]

Robertson HM, Gordon KH
Canonical TTAGG-repeat telomeres and telomerase in the honey bee, Apis mellifera.
Genome Res. 2006 Nov;16(11):1345-51.
The draft assembly of the honey bee Apis mellifera genome sequence reveals that the 17 centromeric-distal telomeres are of a simple, shared, and canonical structure, with 3-4 kb of a unique subtelomeric sequence, followed by several kilobases of TTAGG or variant telomeric repeats. This simple subtelomeric structure differs from the centromeric-proximal telomeres on the short arms of the 15 acrocentric chromosomes, which are apparently composed primarily of the 176-bp AluI tandem repeat. This dichotomy between the distal and proximal telomeres may involve differential participation of the telomeres of the 15 acrocentric chromosomes in the Rabl configuration after mitosis and the chromosome bouquet in meiotic prophase I. As expected from the presence of canonical TTAGG telomeric repeats, we identified a candidate telomerase gene in the bee, as well as the silkmoth Bombyx mori and the flour beetle Tribolium castaneum. [Abstract/Link to Full Text]

Rubin EB, Shemesh Y, Cohen M, Elgavish S, Robertson HM, Bloch G
Molecular and phylogenetic analyses reveal mammalian-like clockwork in the honey bee (Apis mellifera) and shed new light on the molecular evolution of the circadian clock.
Genome Res. 2006 Nov;16(11):1352-65.
The circadian clock of the honey bee is implicated in ecologically relevant complex behaviors. These include time sensing, time-compensated sun-compass navigation, and social behaviors such as coordination of activity, dance language communication, and division of labor. The molecular underpinnings of the bee circadian clock are largely unknown. We show that clock gene structure and expression pattern in the honey bee are more similar to the mouse than to Drosophila. The honey bee genome does not encode an ortholog of Drosophila Timeless (Tim1), has only the mammalian type Cryptochrome (Cry-m), and has a single ortholog for each of the other canonical "clock genes." In foragers that typically have strong circadian rhythms, brain mRNA levels of amCry, but not amTim as in Drosophila, consistently oscillate with strong amplitude and a phase similar to amPeriod (amPer) under both light-dark and constant darkness illumination regimes. In contrast to Drosophila, the honey bee amCYC protein contains a transactivation domain and its brain transcript levels oscillate at virtually an anti-phase to amPer, as it does in the mouse. Phylogenetic analyses indicate that the basal insect lineage had both the mammalian and Drosophila types of Cry and Tim. Our results suggest that during evolution, Drosophila diverged from the ancestral insect clock and specialized in using a set of clock gene orthologs that was lost by both mammals and bees, which in turn converged and specialized in the other set. These findings illustrate a previously unappreciated diversity of insect clockwork and raise critical questions concerning the evolution and functional significance of species-specific variation in molecular clockwork. [Abstract/Link to Full Text]

Dearden PK, Wilson MJ, Sablan L, Osborne PW, Havler M, McNaughton E, Kimura K, Milshina NV, Hasselmann M, Gempe T, Schioett M, Brown SJ, Elsik CG, Holland PW, Kadowaki T, Beye M
Patterns of conservation and change in honey bee developmental genes.
Genome Res. 2006 Nov;16(11):1376-84.
The current insect genome sequencing projects provide an opportunity to extend studies of the evolution of developmental genes and pathways in insects. In this paper we examine the conservation and divergence of genes and developmental processes between Drosophila and the honey bee; two holometabolous insects whose lineages separated approximately 300 million years ago, by comparing the presence or absence of 308 Drosophila developmental genes in the honey bee. Through examination of the presence or absence of genes involved in conserved pathways (cell signaling, axis formation, segmentation and homeobox transcription factors), we find that the vast majority of genes are conserved. Some genes involved in these processes are, however, missing in the honey bee. We have also examined the orthology of Drosophila genes involved in processes that differ between the honey bee and Drosophila. Many of these genes are preserved in the honey bee despite the process in which they act in Drosophila being different or absent in the honey bee. Many of the missing genes in both situations appear to have arisen recently in the Drosophila lineage, have single known functions in Drosophila, and act early in developmental pathways, while those that are preserved have pleiotropic functions. An evolutionary interpretation of these data is that either genes with multiple functions in a common ancestor are more likely to be preserved in both insect lineages, or genes that are preserved throughout evolution are more likely to co-opt additional functions. [Abstract/Link to Full Text]

Savard J, Tautz D, Richards S, Weinstock GM, Gibbs RA, Werren JH, Tettelin H, Lercher MJ
Phylogenomic analysis reveals bees and wasps (Hymenoptera) at the base of the radiation of Holometabolous insects.
Genome Res. 2006 Nov;16(11):1334-8.
Comparative studies require knowledge of the evolutionary relationships between taxa. However, neither morphological nor paleontological data have been able to unequivocally resolve the major groups of holometabolous insects so far. Here, we utilize emerging genome projects to assemble and analyze a data set of 185 nuclear genes, resulting in a fully resolved phylogeny of the major insect model species. Contrary to the most widely accepted phylogenetic hypothesis, bees and wasps (Hymenoptera) are basal to the other major holometabolous orders, beetles (Coleoptera), moths (Lepidoptera), and flies (Diptera). We validate our results by meticulous examination of potential confounding factors. Phylogenomic approaches are thus able to resolve long-standing questions about the phylogeny of insects. [Abstract/Link to Full Text]

Recent Articles in Journal of Applied Genetics

Wyszy?ska-Koko J, Kury? J
A novel polymorphism in exon 1 of the porcine myogenin gene.
J Appl Genet. 2005;46(4):399-402.
Myogenin is a gene belonging to the MyoD family, which codes for the bHLH transcription factor playing a key role in myogenesis. It affects the processes of differentiation and maturation of myotubes during embryogenesis. Fragments of the porcine myogenin coding sequence and promoter region were amplified and subjected to MSSCP analysis. T-->C transition recognised by the MaeIII restriction enzyme in exon 1 was revealed, which appeared to be a silent mutation in the region of the transactivation domain. No other polymorphism was found either in the remaining coding sequence or the promoter region. [Abstract/Link to Full Text]

Kaminski S, Grzybowski G, Prusak B, Ru?? A
No incidence of DUMPS carriers in Polish dairy cattle.
J Appl Genet. 2005;46(4):395-7.
DUMPS (Deficiency of Uridine Monophosphate Synthase) is a hereditary recessive disorder in Holstein cattle causing early embryo mortality during its implantation in the uterus. The only way to avoid the economic losses is early detection of DUMPS carriers. Because American Holstein semen has been intensively imported to Poland since 1970, there was a risk that DUMPS could have spread in Polish dairy cattle. In our study, 2209 dairy cattle of the Polish Holstein breed have been screened by the DNA test. The dominant group was young bulls entering the testing program (1171) and proven bulls (781). They represented all sires entering Polish breeding programs between 1999 and 2003. Also, 257 sire dams were included in the screening program. No DUMPS carrier has been found. Our results then indicate that the population of dairy cattle reared in Poland is free from DUMPS. Because of the economical significance of the DUMPS mutation and its recessive mode of inheritance, attention has to be paid to any case of a bull having in his origin any known DUMPS carrier. Such a bull should be tested and if positive eliminated from the active population. Also, young bulls (testing bulls) should be screened for DUMPS if in their progeny a high incidence of embryo mortality is observed and their genealogy cannot exclude their relatedness to any DUMPS carriers. [Abstract/Link to Full Text]

Chanvijit K, Duangjinda M, Pattarajinda V, Reodecha C
Model comparison for genetic evaluation of milk yield in crossbred Holsteins in the tropics.
J Appl Genet. 2005;46(4):387-93.
The objective of this study was to compare models for appropriate genetic parameter estimation for milk yield (305-day) in crossbred Holsteins in the tropics, where only records from crossbred cows were available. Eleven models with different effects of contemporary group (CG) at calving (herd-year-season or herd-year-month as fixed, and herd-year-month as random), age at calving (as linear or quadratic covariates, age-class, and age-class x lactation), and dominance were considered. On-farm records from small herds (n < 50) were included or excluded to validate the parameter estimates. Average Information Restricted Maximum Likelihood (AIREML) and Best Linear Unbiased Prediction (BLUP) were used to estimate variance components and breeding values. R-square (R2) and standard error of heritability (h2) were used to determine the appropriate model. The estimates of heritability from most models ranged from 0.18 to 0.22. CG formation of herd-year-month as a random effect slightly lowered the additive genetic variance but considerably decreased the permanent environmental variance. The model with age-class x lactation gave better R2 than other age adjustments. The models including records from smallholders gave similar estimates of heritability and a lower standard error than the models excluding them. The estimate of dominance variance as a proportion of total variance was close to zero. The low ratio of dominance to additive genetic variance suggested that the inclusion of dominance effects in the model was unjustified. In conclusion, the model including the effects of herd-year-month, age-class x lactation, as well as additive genetic, permanent environmental and residual effects, was the most appropriate for genetic evaluation in crossbred Holsteins, where records from smallholders could be included. [Abstract/Link to Full Text]

Chakraborty A, Aranishi F, Iwatsuki Y
Molecular identification of hairtail species (Pisces: Trichiuridae) based on PCR-RFLP analysis of the mitochondrial 16S rRNA gene.
J Appl Genet. 2005;46(4):381-5.
A rapid PCR-RFLP analysis was designed to identify 3 closely related species of hairtails: Trichiurus lepturus, T. japonicus, and Trichiurus sp. 2, basing on partial sequence data (600 bp) of the mitochondrial DNA encoding the 16S ribosomal RNA (16S rRNA) gene. Restriction digestion analysis of the unpurified PCR products of these 3 species, using EcoRI and VspI endonucleases, generated reproducible species-specific restriction patterns showing 2 fragments (250 bp and 350 bp) for T. lepturus in EcoRI digestion and 2 fragments (196 bp and 404 bp) for T. japonicus in VspI digestion, whereas no cleavage was observed for Trichiurus sp. 2 in both EcoRI and VspI digestions. The PCR-RFLP technique developed in this study proved to be a rapid, reliable and simple method that enables easy and accurate identification of these 3 closely related species of the genus Trichiurus. [Abstract/Link to Full Text]

Sud S, Bains NS, Nanda GS
Genetic relationships among wheat genotypes, as revealed by microsatellite markers and pedigree analysis.
J Appl Genet. 2005;46(4):375-9.
Genetic relationships among 20 elite wheat genotypes were studied using microsatellite markers and pedigree analysis. A total of 93 polymorphic bands were obtained with 25 microsatellite primer pairs. Coefficient of parentage (COP) values were calculated using parentage information at the expansion level of 5. The pedigree-based similarity (mean 0.115, range 0.00-0.53) was lower than the similarity assessed using microsatellite markers (mean 0.70, range 0.47-0.91). Similarity estimates were used to construct dendrograms by using the unweighted pair-group method with arithmetic averages (UPGMA). Clustering of genotypes in respect of marker-based similarity revealed two groups. Genotype PBW442 diverged and appeared as distinct from all other genotypes in both marker-based and pedigree-based analysis. The correlation of COP values with genetic similarity values based on microsatellite markers is low (r = 0.285, p < 0.05). The results indicate a need to develop wheat varieties with a diverse genetic background and to incorporate new variability into the existing wheat gene pool. [Abstract/Link to Full Text]

Stoja?owski S, Jaciubek M, Masoj? P
Rye SCAR markers for male fertility restoration in the P cytoplasm are also applicable to marker-assisted selection in the C cytoplasm.
J Appl Genet. 2005;46(4):371-3.
The study aimed at testing the usefulness of recently developed SCAR markers on rye (Secale cereale L.) chromosome 4R in hybrid breeding based on the C source of male sterility-inducing cytoplasm. Of 10 markers studied, 4 revealed polymorphisms between 2 inbred lines (544cms-C and Ot0-20) crossed to develop F2 and BC1 mapping populations. Analyses performed on 94 F2 and 93 BC1 plants allowed to extend a formerly constructed genetic map of chromosome arm 4RL. Three SCAR markers (SCP14M55, SCP15M55 and SCP16M58) were mapped in the vicinity of gene Rfc1, which restores male fertility in the C cytoplasm. The 3 tested SCAR markers proved to be effective in marker-assisted selection (MAS) for male fertility/sterility. [Abstract/Link to Full Text]

Wang HY, Liu DC, Yan ZH, Wei YM, Zheng YL
Cytological characteristics of F2 hybrids between Triticum aestivum L. and T. durum Desf. with reference to wheat breeding.
J Appl Genet. 2005;46(4):365-9.
Cytological and agronomic characteristics of a F2 population from Triticum aestivum L. x T. durum Desf. hybrids were analyzed plant by plant. Means of morphologic traits in the F2 population were similar to those of the low-value parent. On average, F2 hybrids had 36.54 chromosomes per plant, indicating that each gamete lost 2.73 chromosomes at meiosis of the F1 generation. More than half of plants had 36-39 chromosomes, so male gametes with 19-21 chromosomes seemed to be superior to the others. The distribution frequency of chromosomes in this study differed from that in a previous report, where a different tetraploid wheat was used. This shows that a different breeding strategy may need to be taken when exploiting a different tetraploid wheat. According to our results, some plants with 42 chromosomes, having all the wheat A, B and D chromosomes, would appear in the F3 population, which provides a chance to obtain stable bread wheat lines from the self-pollinated progenies. Alternatively, the desirable individuals of the F2 population were backcrossed to bread wheat, which is very useful and efficient for the improvement of bread wheat by exploiting desirable genes in durum wheat. [Abstract/Link to Full Text]

Blaszczyk L, Tyrka M, Che?kowski J
PstIAFLP based markers for leaf rust resistance genes in common wheat.
J Appl Genet. 2005;46(4):357-64.
The aim of the present study was to detect candidate DNA markers for selected leaf rust resistance genes. A total number of 286 loci in the 'Thatcher' near-isogenic lines carrying resistance gene Lr1, Lr9, Lr10, Lr13, Lr19, Lr21, Lr24, Lr26, Lr28, Lr35, and Lr37 were screened for DNA polymorphism by the PstIAFLP method. A survey with 33 selective primers yielded 16 candidate markers. Further validation studies on cultivars characterized for the presence and absence of selected resistance genes confirmed specificity of markers for Lr24, Lr26 and Lr37. The AFLP-based marker P42-530 was successfully converted into an STS marker. The new marker was linked with the Lr37-specific marker (CslVrga13) at the distance of 1.7 cM. The PstIAFLP method was found to be effective in the identification of DNA changes induced in hexaploid wheat by translocations from Agropyron elongatum, Secale cereale and Aegilops ventricosa. [Abstract/Link to Full Text]

Yue YW, Long H, Liu Q, Wei YM, Yan ZH, Zheng YL
Isolation of low-molecular-weight glutenin subunit genes from wild emmer wheat (Triticum dicoccoides).
J Appl Genet. 2005;46(4):349-55.
Three low-molecular-weight glutenin subunit (LMW-GS) genes, designated LMW-Td1, LMW-Td2 and LMW-Td3, were isolated from wild emmer wheat (Triticum dicoccoides), which is the tetraploid progenitor of common wheat (T. aestivum). The complete nucleotide sequence lengths of LMW-Td1, LMW-Td2 and LMW-Td3 are 858, 900 and 1062 bp, respectively. LMW-Td1 and LMW-Td3 can encode proteins with 284 and 352 amino acid residues, respectively, whereas LMW-Td2 is a putative pseudogene due to the presence of 3 inframe stop codons in its C-terminal domain. The deduced protein sequences of the 3 genes share the same typical polypeptide structures with known LMW-GS genes containing 8 cysteines in the mature protein domains. LMW-Td1 was clearly distinguished from all known LMW-GS genes, and considered as a novel LMW-GS gene. Two hydrophobic motifs (i.e. PIIIL and PVIIL) were observed in the repetitive domain of LMW-Td3. Sequence comparison indicates that sequences of the 3 LMW-GS genes from this study are strongly similar to known LMW-GS genes. Our phylogenetic analysis suggests that LMW-Td1 and LMW-Td2 are homologous with genes on chromosome 1A, and LMW-Td3 is closely related to genes on chromosome 1B. [Abstract/Link to Full Text]

Latos-Biele?ska A, Materna-Kiryluk A
Polish Registry of Congenital Malformations - aims and organization of the registry monitoring 300 000 births a year.
J Appl Genet. 2005;46(4):341-8.
In 1997, the Polish Registry of Congenital Malformations (PRCM) was established, to fulfil epidemiological, prophylactic, socioeconomic and scientific functions. The PRCM is a population-based registry monitoring currently about 300 000 births a year in 13 provinces. Such a large area and population require a special organizational structure of the Registry. The PRCM Central Working Group and the computer database are located in the Department of Medical Genetics, University of Medical Sciences, Pozna?. Here the data are collected, validated, encoded according to the ICD-10, and analysed. Provincial Working Groups are responsible for supervision of data collection in the given province. The PRCM staff has grown from about 250 members in 1997 to more than 400 members today. The PRCM collects information on structural defects diagnosed before the end of the second year of life. Minor anomalies are excluded from the registry. The main source of information is a registration form filled up by the physician diagnosing the anomaly. Since 2004 also electronic reporting has been possible. On 28 September 2005 there were 54 020 entries in the database concerning 33 729 children with at least one congenital malformation and 1261 control entries concerning children without malformations. The PRCM is also an important source of identification of families at genetic risk. Education of physicians and the community in the field of genetic counselling is also an important aim of the PRCM. Since 2001, the PRCM has been a member of the Eurocat. Detailed information on PRCM organization, electronic reporting, and results are available at the PRCM website (www.rejestrwad.pl). [Abstract/Link to Full Text]

?ugowska A, Szyma?ska K, Kmiec T, Tarczy?ska I, Czartoryska B, Tylki-Szyma?ska A, Jurkiewicz E
Homozygote for mutation c.1204 + 1G > A of the ARSA gene presents with a late-infantile form of metachromatic leukodystrophy and a rare MRI white matter lesion type.
J Appl Genet. 2005;46(3):337-9.
The metachromatic leukodystrophy (MLD)--causing mutation c.1204 + 1G > A damages an intron-exon splice site recognition sequence. This results in a complete loss of enzymatic activity of arylsulfatase A (ARSA) protein molecules. We have found a late-infantile type MLD-patient to be homozygous for this mutation, which was not reported earlier, but is consistent with previous suggestions. Interestingly, the cerebral magnetic resonance imaging (MRI) in this patient displayed linear or punctuate structures radiating in the demyelinated white matter, which resembled the patterns described in Pelizaeus-Merzbacher disease. It should be emphasised that whenever a cerebral MRI demonstrates the "tigroid" or "leopard-skin" demyelination pattern not only Pelizaeus-Merzbacher disease, but also metachromatic leukodystrophy diagnosis should be considered; this suggests the necessity of ARSA activity estimations in patients with such specific MRI patterns. [Abstract/Link to Full Text]

Srebniak M, Popowska L, Wawrzkiewicz-Witkowska A, Tomaszewska A, Kazmierczak W
Subfertile couple with t(4;22)(q23;q11.2).
J Appl Genet. 2005;46(3):333-6.
A couple was referred for cytogenetic examination due to idiopathic miscarriages. The proband proved to be a carrier of chromosomal translocation and her partner's karyotype was found to be normal. The karyotype of the proband is 46,XX,t(4;22)(q23;q11.2) and can be regarded as a reason of fertility problems in the investigated couple. The risk of further miscarriages is high, but the risk of a progeny with abnormal karyotype is rather low, as the progeny would probably have lethal imbalances. [Abstract/Link to Full Text]

Adler G, Widecka K, Peczkowska M, Dobrucki T, Placha G, Drozd R, Parczewski M, Januszewicz A, Gaciong Z, Ciechanowicz A
Genetic screening for glucocorticoid-remediable aldosteronism (GRA): experience of three clinical centres in Poland.
J Appl Genet. 2005;46(3):329-32.
Glucocorticoid-remediable aldosteronism (GRA), also known as familial hyperaldosteronism type I (FH-I, OMIM 103900), is a monogenic form of inherited hypertension caused by the presence of a chimaeric gene originating from an unequal cross-over between the CYP11B1 (11beta-hydroxylase) and CYP11B2 (aldosterone synthase) genes. The hybrid gene has the CYP11B1 sequence at the 5' end, including the promoter, and the CYP11B2 sequence at the 3' end. The aim of our study was to evaluate the prevalence of GRA in a Polish population of 129 patients with primary hyperaldosteronism (PHA) and 132 patients with essential hypertension (EH), through the use of a PCR-based test revealing the chimaeric gene. None of our PHA or EH patients was positive for the CYP11B1/CYP11B2 chimaeric gene. These data suggest that GRA is unlikely to be a common cause of hypertension in Polish subjects. However, the real prevalence of GRA in Poland, both in the high-risk group of individuals with primary hyperaldosteronism and in the general population, remains to be established. [Abstract/Link to Full Text]

Bauer PO, Matoska V, Zumrova A, Boday A, Doi H, Marikova T, Goetz P
Genotype/phenotype correlation in a SCA1 family: anticipation without CAG expansion.
J Appl Genet. 2005;46(3):325-8.
We report on a family with spinocerebellar ataxia type 1 (SCA1), in which the age at onset and the severity of the disease do not correlate with the number of CAG repeat units. Although a marked anticipation was observed in the proband, it was not a consequence of an expansion of the CAG tract. None of the expanded alleles contained CAT interruptions. The pathologic expansion in this family was stable during the paternal but not maternal transmission, where it expanded by one trinucleotide and unexpectedly did not lead to anticipation. Our observations suggest that factors other than the length of the CAG repeat play a considerable role in determination of the disease course. [Abstract/Link to Full Text]

Karpi?ski TM, Kostrzewska-Poczekaj M, Stachecki I, Mikstacki A, Szyfter K
Genotoxicity of the volatile anaesthetic desflurane in human lymphocytes in vitro, established by comet assay.
J Appl Genet. 2005;46(3):319-24.
The aim of the present study was to estimate the genotoxicity of desflurane, applied as a volatile anaesthetic. The potential genotoxicity was determined by the comet assay as the extent of DNA fragmentation in human peripheral blood lymphocytes in vitro. The comet assay detects DNA strand breaks induced directly by genotoxic agents as well as DNA fragmentation due to cell death. Another anaesthetic, halothane, already proved to be a genotoxic agent, was used as a positive control. Both analysed drugs were capable of increasing DNA migration in a dose-dependent manner under experimental conditions applied. The results of the study demonstrated that the genotoxicity of desflurane was comparable with that of halothane. However, considering the pharmacodynamics of both drugs, the genotoxic activity of desflurane may be connected with a less harmful effect on the exposed patients or medical staff. [Abstract/Link to Full Text]

Dybus A, Knapik K
A new PCR-RFLP within the domestic pigeon (Columba livia var. domestica) cytochrome b (MTCYB) gene.
J Appl Genet. 2005;46(3):315-7.
A total of 244 domestic pigeons (Columba livia var. domestica) were genotyped using the PCR-RFLP method. A 999 bp fragment of the MTCYB gene was amplified. The amplification products were digested with restriction enzymes. PCR-RFLP for MvaI restriction enzyme was observed. Frequencies of alleles were as follows: MTCYB(C)--0.926, MTCYB(G)-- 0.074. The frequencies of MTCYB/MvaI alleles found in this study for non-homing pigeons considerably deviate from the values found for homing/racing pigeons (allele MTCYB(G) occurred only in the non-homing breeds). [Abstract/Link to Full Text]

Gruszczy?ska J, Brokowska K, Charon KM, Swiderek WP
Restriction fragment length polymorphism of exon 2 Ovar-DRB1 gene in Polish Heath Sheep and Polish Lowland Sheep.
J Appl Genet. 2005;46(3):311-4.
Exon 2 of the Ovar-DR gene is known to encode the MHC outer domain (alpha or beta chain) that forms the binding area to antigens presented. The study was aimed at analysing exon 2 Ovar -DRB1 gene polymorphism in Polish Heath Sheep and Polish Lowland Sheep (Zelazna variety). A total of 101 and 99 ewes of the respective breeds were included in this study. We identified 65 different haplotypes in Polish Heath Sheep and 68 in Polish Lowland Sheep. The PCR-RFLP method and PCR products sequencing made it possible to identify two new sequences of exon 2 Ovar-DRB1 gene (AY230000 and AY248695). A distinct polymorphism in the exon 2 sequence presents possibilities for immune response toward a great variety of pathogens. [Abstract/Link to Full Text]

Yilmaz A, Davis ME, Hines HCh, Chung H
Detection of two nucleotide substitutions and putative promoters in the 5' flanking region of the ovine IGF-I gene.
J Appl Genet. 2005;46(3):307-9.
The objective of this study was to search for polymorphisms and gene regulatory sequences in the 5' flanking region of the sheep insulin-like growth factor I (IGF-I) gene. PCR-SSCP analysis of the 5' flanking region revealed three banding patterns. Family study indicated that these patterns in mixed breed sheep corresponded with three genotypes (with their frequencies in parentheses) AA (0.70), AB (0.25), and BB (0.05), which arose from a one-locus, two allele (A, B) polymorphism. Genotypic frequencies in 22 purebred Polypay sheep were AA (0.77) and AB (0.23). Calculated frequency of the A allele in Polypays was 0.89. No deviation from Hardy-Weinberg equilibrium was detected in this study. Fragments amplified using DNA from homozygous individuals were sequenced and aligned next to each other. A T to C transition and a G to C transversion were found at positions 179 and 181, respectively, of the amplified PCR product, resulting in recognition sites for Bsp143II and HaeI. Analysis of a fragment of 2,162 base pairs upstream of Exon 1, assembled from sheep ESTs and sequence of our amplified PCR products, revealed a promoter sequence approximately 100 bp downstream of the polymorphic sites. The assembled DNA fragment shared 70% sequence homology between sheep and human. These results suggest that sequence of the 5' flanking region of IGF-I gene and location of the IGF-I promoters are similar in human and sheep. [Abstract/Link to Full Text]

Zabek T, Nogaj A, Radko A, Nogaj J, S?ota E
Genetic variation of Polish endangered Bi?goraj horses and two common horse breeds in microsatellite loci.
J Appl Genet. 2005;46(3):299-305.
Genetic variation of endangered Bi?goraj horses and two common Polish horse breeds was compared with the use of 12 microsatellite loci (AHT4, AHT5, ASB2, HMS2, HMS3, HMS6, HMS7, HTG4, HTG6, HTG7, HTG10, VHL20). Lower allelic diversity was detected in all investigated populations in comparison to other studies. Large differences in the frequencies of microsatellite alleles between Bi?goraj horses and two other horse breeds were discovered. In all polymorphic loci all investigated breeds were in the Hardy-Weinberg equilibrium. Mean Fis values and the results of a test for the presence of a recent bottleneck were non-significant in all studied populations. Comparable values of observed and expected gene diversity indicate no substantial loss of genetic variation in the Bi?goraj population and two other breeds. The lowest variability observed in the investigated group of Thoroughbred horses was confirmed. About 10% of genetic variation are explained by differences between breeds. Values of pairwise Fst and two measures of genetic distance demonstrated that Bi?goraj horses are distantly related to both common horse breeds. [Abstract/Link to Full Text]

Pradeep AR, Chatterjee SN, Nair CV
Genetic differentiation induced by selection in an inbred population of the silkworm Bombyx mori, revealed by RAPD and ISSR marker systems.
J Appl Genet. 2005;46(3):291-8.
Artificial selection has been widely utilized in breeding programmes concerning the commercially important silk-producing insect Bombyx mori. Selection increases the frequency of homozygotes and makes homozygous effects stronger. Molecular variation induced by selection in the inbred population of B. mori strain Nistari, was assessed in terms of genic differentiation by using a polymorphic profile generated by RAPD and ISSR marker systems. Artificial selection for longer larval duration (LLD) for 4 generations resulted in a significant prolongation of larval duration (F = 89.28; P = 5.14 x 10(-7)). The lines selected for shorter larval duration (SLD) were not significantly different from the control group. RAPD and ISSR primers generated polymorphic profiles when amplified with genomic DNA of individuals of LLD and SLD lines. Distinct markers specific to LLD individuals were observed from the 3rd generation and indicated selection-induced differentiation of allelic variants for longer larval duration. Both SLD and LLD were characterized by high gene diversity (h approximately equal to 0.197) and total heterozygosity (Ht > or =0.26), low homogeneity (chi-square test, p < 0.005) as well as a large coefficient of gene differentiation (Gst > or =0.42) but low gene flow (Nm < or =0.42). Genetic distance was the highest (0.824) between 3rd generations of SLD and LLD. High heterozygosity and prolonged larval duration substituted for shorter larval duration (the traditional trait of fitness) in the Nistari LLD larvae. [Abstract/Link to Full Text]

Song C, Gao B, Teng Y, Wang X, Wang Z, Li Q, Mi H, Jing R, Mao J
MspI polymorphisms in the 3rd intron of the swine POU1F1 gene and their associations with growth performance.
J Appl Genet. 2005;46(3):285-9.
The study aimed to compare MspI polymorphisms in the 3rd intron of the porcine gene encoding the pituitary-1 transcription factor (Pit-1, renamed as POU1F1) among 5 breeds and to determine the associations between its genotypes and growth performance in a commercial pig population by using the PCR-RFLP technique. Significant differences in genotypic and allelic frequencies were found between the meat-type and fat-type breeds (P < 0.05), and between miniature pigs and others (P < 0.05). No breed deviated from the Hardy-Weinberg equilibrium (verified by chi-square test). The general linear model analysis revealed that higher body weight on day 180 (BW180) and average daily gain (ADG) were significantly associated with POU1F1 DD genotype (P < 0.05). The differences in BW180 and ADG between DD pigs and both CD and CC pigs were significant (P < 0.05), and the DD pigs had a significantly higher body weight on day 45 (BW45) and on day 70 (BW70) than CC pigs (P < 0.05). All measured growth traits, except for body weight at birth (BWB), showed higher values in DD pigs. The D allele had a favorable positive effect on growth traits. Thus POU1F1 is a potential major gene or marker for growth traits. [Abstract/Link to Full Text]

Trivedi M, Dhawan OP, Tiwari RK, Sattar A
Genetic studies on collar rot resistance in opium poppy (Papaver somniferum L.).
J Appl Genet. 2005;46(3):279-84.
The collar rot disease has been reported recently and occurs at the 10-12-leaf stage of plants of opium poppy. Infected plants topple down and dry prematurely due to fast rotting at the collar region. The inoculum for this study was multiplied on the cornmeal-sand culture. Genetic ratios were calculated by the chi-square test. Inheritance studies on this disease show a monogenic pattern of segregation with the ratio of 3 : 1 at F2, 1 : 2 : 1 at F3 and 1 : 1 at the backcross. Such genetic ratios clearly indicate that a single recessive gene (rs-1) is responsible for disease resistance in opium poppy. The inference drawn on the basis of the present study will be a great help in the future breeding programme of opium poppy for collar rot resistance. [Abstract/Link to Full Text]

Krzakowa M, Matras J
Genetic variability among beech (Fagus sylvatica L.) populations from the Sudety Mountains, in respect of peroxidase and malate dehydrogenase loci.
J Appl Genet. 2005;46(3):271-7.
Individual trees growing in five populations of European beech (Fagus sylvatica L.) in the Sudety Mountains were investigated in respect of variability of peroxidases (2 loci) and malate dehydrogenase (1 locus). Differences between populations were illustrated by a dendrogram constructed on the basis of Hedrick's (1974) genetic distances. The mean GST coefficient (=0.0333) value demonstrated the higher level of intra-population variability, as compared to the inter-population (DST = 0.0149) variability. [Abstract/Link to Full Text]

G�recka K, Krzyzanowska D, G�recki R
The influence of several factors on the efficiency of androgenesis in carrot.
J Appl Genet. 2005;46(3):265-9.
The influence of cultivar, donor plant and culture procedure on the efficiency of androgenesis was studied in carrot anther culture. Experiments were carried out on five carrot cultivars: CxC 9900 F1, Lucky B F1, HCM, Beta III and Perfekcja, which were chosen because of their high carotene contents. Two procedures of anther culture were compared: (1) incubation in darkness for two weeks, followed by exposure to continuous light and transfer onto a fresh medium of the same composition; and (2) incubation in darkness until embryos appeared, without transfer onto a fresh medium. Temperature was +27 degrees C all the time. Genotype played an important role in the process of androgenesis in carrot anther culture.The efficiency was the highest in cv. HCM - 5.6 embryos per 100 anthers. Considerable differences in the capacity for androgenesis were observed between individual donor plants. The ratio of embryos obtained per 100 anthers for cv. HCM varied from 0.0 to 48.9. The second procedure of anther culture proved to be more efficient, cheaper and less complicated. [Abstract/Link to Full Text]

Khanna R, Bansal UK, Saini RG
Genetics of durable resistance to leaf rust and stripe rust of an Indian wheat cultivar HD2009.
J Appl Genet. 2005;46(3):259-63.
The Indian bread wheat cultivar HD2009 has maintained its partial resistance to leaf rust and stripe rust in India since its release in 1976. To examine the nature, number and mode of inheritance of its genes for partial leaf rust and stripe rust resistance, this cultivar was crossed with cultivar WL711, which is susceptible to leaf rust and stripe rust. The F1, F2, F3 and F5 generations from this cross were assessed separately for adult plant disease severity under artificial epidemic of race 77-5 of leaf rust and race 46S119 of stripe rust. Segregation for rust reaction in the F2, F3 and F5 generations indicated that resistance to each of these rust diseases is based on 2 genes, each with additive effects. Although the leaf rust resistance of HD2009 is similar in expression to that conferred by the gene Lr34, but unlike the wheats carrying this gene, cultivar HD2009 did not show leaf tip necrosis, a morphological marker believed to be tightly linked to the leaf rust resistance gene Lr34. Thus, the non-hypersensitive resistance of HD2009 was ascribed to genes other than Lr34. [Abstract/Link to Full Text]

Rad?owski M
Proteolytic enzymes from generative organs of flowering plants (Angiospermae).
J Appl Genet. 2005;46(3):247-57.
Pollen proteases were discovered over 100 years ago, whereas the enzymes from female tissues have been used since the Roman era in simple biotechnological processes. In the last decade a great progress has been made in studies on plant proteases, including those from the generative organs. This paper reviews reports published in the last decade, concerning purification, properties and localization of proteases from generative parts of flowering plants against the background of the general proteolytic machinery of the plant. Special attention is paid to differences in protease structure and properties in comparison to other enzymes from the same catalytic classes. Participation of the proteases in all steps of pollen-pistil interaction as well as in pollen tube growth is discussed. Further intensive studies with use of native substrates are necessary to understand the role of proteases in pollination. [Abstract/Link to Full Text]

Podg�rska B, Chec E, Ulanowska K, Wegrzyn G
Optimisation of the microbiological mutagenicity assay based on genetically modified Vibrio harveyi strains.
J Appl Genet. 2005;46(2):241-6.
Recently, we have developed a novel assay designed for detection of mutagenic pollution of the marine environment. This assay is based on the use of a series of genetically modified strains (named BB7, BB7M, BB7X and BB7XM) of a marine bacterium Vibrio harveyi. Sensitivity of the V. harveyi mutagenicity assay was found to be similar to, or even somewhat higher than, that of the commonly used Ames test. Subsequent studies indicated that this assay may be useful in assessment of mutagenic contamination of the marine environment. Nevertheless, we assumed that improvement of this assay is still possible, and thus we aimed to optimise its procedures. Here we present our research on the optimisation of the V. harveyi mutagenicity assay, which indicated that different tester strains used in this assay give the best results depending upon the experimental conditions employed. Incubation of bacteria in a buffer, rather than in a nutrient broth, containing a mutagen, increased the efficiency of the assay with BB7 and BB7M strains, but had a deleterious effect in the case of BB7X and BB7XM. The latter couple of strains revealed higher mutagenicity in the plate assay, as compared to the liquid medium assay. However, the opposite effect was observed for BB7 and BB7M. Low-dose (1 J m(-2)) UV irradiation, as well as 30 min incubation in 0.1 M CaCl2, had no significant effect on the efficiency of the assay when using BB7 and BB7M, whereas the number of mutagen-induced mutants of BB7X and BB7XM strains increased about two times under these conditions. Our previous experiments indicated that various tester strains revealed different sensitivity to particular mutagens. Thus, a series of strains should be used in the assay. Results presented in this report show that different conditions should be used for two pairs of the tester strains: BB7 and BB7M, and BB7X and BB7XM. [Abstract/Link to Full Text]

Su?ek A, Hoffman-Zacharska D, Krysa W, Szirkowiec W, Fidzia?ska E, Zaremba J
CAG repeat polymorphism in the androgen receptor (AR) gene of SBMA patients and a control group.
J Appl Genet. 2005;46(2):237-9.
Spinobulbar muscular atrophy (SBMA) is an X-linked form of motor neuron disease characterized by progressive atrophy of the muscles, dysphagia, dysarthria and mild androgen insensitivity. SBMA is caused by CAG repeat expansion in the androgen receptor gene. CAG repeat polymorphism was analysed in a Polish control group (n = 150) and patients suspected of SBMA (n = 60). Normal and abnormal ranges of CAG repeats were established in the control group and in 21 patients whose clinical diagnosis of SBMA was molecularly confirmed. The ranges are similar to those reported for other populations. [Abstract/Link to Full Text]

Gedrange T, B�ttner C, Schneider M, Oppitz R, Harzer W
Myosin heavy chain protein and gene expression in the masseter muscle of adult patients with distal or mesial malocclusion.
J Appl Genet. 2005;46(2):227-36.
The aim of this study was to determine the amount of myosin heavy chain (MyHC) proteins and MyHC mRNA in muscles of patients with different positions of the mandible. Ten adult patients for orthognathic surgery were divided into two groups: distal and mesial malocclusion. The mRNA expression of two MyHC isoforms of the anterior and posterior part of the right and left side of the human masseter muscle was analysed with a competitive RT-PCR assay. An exogenous template that includes oligonucleotide sequences specific for sarcomeric MyHC isoforms (1 and 2x) was constructed and utilized as competitor. Different isoforms of the MyHC protein were identified by Western blot analysis. In the total mRNA pool of the masseter muscle, the MyHC 1 mRNA level was 25.5 +/- 7.6% and the MyHC 2x mRNA was 2.5 +/- 1.2%. The anterior part of the masseter muscle from patients with distal occlusion contained more type 1 and 2x MyHC mRNA, as compared to patients with mesial occlusion (P < 0.05). No difference in the protein distribution was observed. The differences in mRNA expression may be caused by the enforced stress of the masticatory muscle in distal occlusion because of the disadvantageous pivot. [Abstract/Link to Full Text]

Cao W, Hunter R, Strnatka D, McQueen CA, Erickson RP
DNA constructs designed to produce short hairpin, interfering RNAs in transgenic mice sometimes show early lethality and an interferon response.
J Appl Genet. 2005;46(2):217-25.
Arylamine N-acetyltransferase (NAT) genes were targeted for inhibition using short hairpin RNA (shRNA) using two different RNA polymerase III promoters. Constructs were developed for NAT1 and NAT2, the endogenous mouse genes, and for human NAT1. There were fetal and neonatal deaths with these constructs, perhaps due in part to an interferon response as reflected in increases in oligoadenylate synthetase I mRNA levels. Seven out of 8 founders with the U6 promoter generated offspring but only 2 gave positive offspring. Out of 15 founders for H1 promoted constructs, only 4 had positive offspring. When transgenic lines were successfully established, the expression of the targeted genes was variable between animals and was not generally inhibitory. [Abstract/Link to Full Text]

Recent Articles in Genetics and Molecular Research

Okamoto HT, Soares CM, Pereira M
Comparative analyses of the structure of the 1,3-beta-glucan synthase gene in Paracoccidioides brasiliensis isolates.
Genet Mol Res. 2006;5(2):407-18.
The evolutionary origin and significance of spliceosomal introns have been the subject of many investigations. Two theories, "introns-early" theory and "introns-late" theory, have been proposed to explain the evolution of introns in eukaryotic genes. Intron position is generally conserved in paralogue and orthologue genes. Some introns occur at similar but not necessarily identical positions in homologous genes, which were separated by great evolutionary distances. This event can be explained by insertion, loss or movement of the intron over short distances. Intron loss and gain events are unique in evolution and can be useful as markers for phylogenetic analyses. The insertion of introns at an identical position suggests a common ancestor gene. Here we analyzed, using PCR and RT-PCR, the structure of the 1,3-beta-glucan synthase gene (FKS) in several clinical isolates of Paracoccidioides brasiliensis (Pb): isolates Pb 01, Pb 4940, Pb 8515, Pb 8311, Pb 8334, Pb 4268, Pb 1668, and Pb E. Our results showed that seven of the isolates examined showed identical structures concerning the position of introns in PbFKS1. PbFKS4940 showed the intron described at the 3' end and had lost that one at the 5' end. The presence of the PbFKS4940 transcript suggests that it could be a functional gene. These data suggest a divergent evolution for introns with regard to the 1,3-beta-glucan synthase gene in P. brasiliensis isolates. [Abstract/Link to Full Text]

Fernandez R, Pasaro E
Molecular analysis of an idic(Y)(qter -->p11.32::p11.32-->qter) chromosome from a female patient with a complex karyotype.
Genet Mol Res. 2006;5(2):399-406.
A female patient with a structurally abnormal idic(Y) (p11.32) chromosome was studied using fluorescence in situ hybridization and PCR to define the precise position of the breakpoint. The patient had a complex mosaic karyotype with eight cell lines and at least two morphologically distinct derivatives from the Y chromosome. The rearrangement was a result of a meiosis I exchange between sister chromatids at the pseudoautosomal region, followed by centromere misdivision at meiosis II. Due to instability of the dicentric Y chromosome, new cell lines later arose because of mitotic errors occurring during embryonic development. Physical examination revealed a normal female phenotype without genital ambiguity, a normal uterus and rudimentary gonads which were surgically removed. [Abstract/Link to Full Text]

Ar�oz HV, Torrado M, Barreiro C, Chertkoff L
A combination of five short tandem repeats of chromosome 15 significantly improves the identification of Prader-Willi syndrome etiology in the Argentinean population.
Genet Mol Res. 2006;5(2):390-8.
Prader-Willi syndrome (PWS) is a multisystemic disorder caused by the loss of expression of paternally transcribed genes in the PWS critical region of chromosome 15. Various molecular mechanisms are known to lead to PWS: deletion 15q11-q13 (75% of cases), maternal uniparental disomy (matUPD15) (23%) and imprinting defects (2%). FISH and microsatellite analysis are required to establish the molecular etiology, which is essential for appropriate genetic counseling and care management. We characterized an Argentinean population, using five microsatellite markers (D15S1035, D15S11, D15S113, GABRB3, D15S211) chosen to develop an appropriate cost-effective method to establish the parental origin of chromosome 15 in nondeleted PWS patients. The range of heterozygosity for these five microsatellites was 0.59 to 0.94. The average heterozygosity obtained for joint loci was 0.81. The parental origin of chromosome 15 was established by microsatellite analysis in 19 of 21 non-deleted PWS children. We also examined the origin of the matUPD15; as expected, most of disomies were due to a maternal meiosis I error. The molecular characterization of this set of five microsatellites with high heterozygosity and polymorphism information content improves the diagnostic algorithm of Argentinean PWS children, contributing significantly to adequate genetic counseling of such families. [Abstract/Link to Full Text]

Guo X, Xu G, Zhang Y, Wen X, Hu W, Fan L
Incongruent evolution of chromosomal size in rice.
Genet Mol Res. 2006;5(2):373-89.
To investigate genome size evolution, it is usually informative to compare closely related species that vary dramatically in genome size. A whole genome duplication (polyploidy) that occurred in rice (Oryza sativa) about 70 million years ago has been well documented based on current genome sequencing. The presence of three distinct duplicate blocks from the polyploidy, of which one duplicated segment in a block is intact (no sequencing gap) and less than half the length of its syntenic duplicate segment, provided an excellent opportunity for elucidating the causes of their size variation during the post-polyploid time. The results indicated that incongruent patterns (shrunken, balanced and inflated) of chromosomal size evolution occurred in the three duplicate blocks, spanning over 30 Mb among chromosomes 2, 3, 6, 7, and 10, with an average of 20.3% for each. DNA sequences of chromosomes 2 and 3 appeared to had become as short as about half of their initial sequence lengths, chromosomes 6 and 7 had remained basically balanced, and chromosome 10 had become dramatically enlarged (approximately 70%). The size difference between duplicate segments of rice was mainly caused by variations in non-repetitive DNA loss. Amplification of long terminal repeat retrotransposons also played an important role. Moreover, a relationship seems to exist between the chromosomal size differences and the nonhomologous combination in corresponding regions in the rice genome. These findings help shed light on the evolutionary mechanism of genomic sequence variation after polyploidy and genome size evolution. [Abstract/Link to Full Text]

Brancaleoni GH, Lourenzoni MR, Degr�ve L
Study of the influence of ethanol on basic fibroblast growth factor structure.
Genet Mol Res. 2006;5(2):350-72.
The growth of cells is controlled by stimulatory or inhibitory factors. More than twenty different families of polypeptide growth factors have been structurally and functionally characterized. Basic fibroblast growth factor (bFGF) of the fibroblast growth factor family was characterized in 1974 as having proliferative activity for fibroblastic cells. The inhibitory effects of ethanol on cell proliferation result from interference with mitogenic growth factors (e.g., bFGF, EGF and PDGF). In order to better understand the mode of action of bFGF, particularly regarding the influence of ethanol on the biological activity of bFGF, three recombinant bFGF mutants were produced (M6B-bFGF, M1-bFGF and M1Q-bFGF). In the present study, wild bFGF and these mutants were examined by molecular dynamics simulations in systems consisting of a solute molecule in ethanol solution at 298 K and physiological pH over 4.0 ns. The hydrogen bonds, the root mean square deviations and specific radial distribution functions were employed to identify changes in the hydrogen bond structures, in the stability and in the approximation of groups in the different peptides to get some insight into the biological role of specific bFGF regions. The detailed description of the intramolecular hydrogen bonds, hydration, and intermolecular hydrogen bonds taking place in bFGF and its mutants in the presence of ethanol established that the residues belonging to the beta5 and beta9 strands, especially SER-73(beta5), TYR-112(beta9), THR-114(beta9), TYR-115(beta9), and SER-117(beta9), are the regions most affected by the presence of ethanol molecules in solution. [Abstract/Link to Full Text]

Onrat ST, A?�i F, Ozkan M
A cytogenetics study of Hydrodroma despiciens (M�ller, 1776) (Acari: Hydrachnellae: Hydrodromidae).
Genet Mol Res. 2006;5(2):342-9.
The karyotypes of water mites (Acari: Hydrachnellae: Hydrodromidae) are largely unknown. The present investigation is the first report of a study designed to characterize the chromosomes of water mites. The study was carried out with specimens of Hydrodroma despiciens collected from Eber Lake in Afyon, Turkey. Several different methods were tried to obtain chromosomes of this species. However, somatic cell culture proved to be the most effective for the preparation of chromosomes. In the present study, we determined the diploid chromosome number of Hydrodroma despiciens to be 2n = 16. However, a large metacentric chromosome was found in each metaphase, which we believed to be the X chromosome. We could not determine the sex chromosomes of this species. This study is the first approach to the cytogenetic characterization of this water mite group. Furthermore, these cytogenetic data will contribute to the understanding of the phylogenetic relationship among water mites. To our knowledge, this is the first report on the cytogenetics of water mites. [Abstract/Link to Full Text]

Fileto R, Kuser PR, Yamagishi ME, Ribeiro AA, Quinalia TG, Franco EH, Mancini AL, Higa RH, Oliveira SR, Santos EH, Vieira FD, Mazoni I, Cruz SA, Neshich G
PDB-Metrics: a web tool for exploring the PDB contents.
Genet Mol Res. 2006;5(2):333-41.
PDB-Metrics (http://sms.cbi.cnptia.embrapa.br/SMS/pdb_metrics/index.html) is a component of the Diamond STING suite of programs for the analysis of protein sequence, structure and function. It summarizes the characteristics of the collection of protein structure descriptions deposited in the Protein Data Bank (PDB) and provides a Web interface to search and browse the PDB, using a variety of alternative criteria. PDB-Metrics is a powerful tool for bioinformaticians to examine the data span in the PDB from several perspectives. Although other Web sites offer some similar resources to explore the PDB contents, PDB-Metrics is among those with the most complete set of such facilities, integrated into a single Web site. This program has been developed using SQLite, a C library that provides all the query facilities of a database management system. [Abstract/Link to Full Text]

Mukhopadhyaya PN, Jha M, Muraleedharan P, Gupta RR, Rathod RN, Mehta HH, Khoda VK
Simulation of normal, carrier and affected controls for large-scale genotyping of cattle for factor XI deficiency.
Genet Mol Res. 2006;5(2):323-32.
An insertion mutation within exon 12 of the factor XI gene has been described in Holstein cattle. This has opened the prospect for large-scale screening of cattle using the polymerase chain reaction (PCR) technique for the rapid identification of heterozygous animals. To facilitate such a screening process, the mutant and normal alleles of factor XI gene, represented by 244- and 320-bp PCR amplified fragments, were individually cloned in Escherichia coli using a multicopy plasmid cloning vehicle to generate pFXI-N and pFXI-M, respectively. The authenticity of the inserts was confirmed by nucleotide sequencing. A nested PCR method was developed, by which PCR amplicons generated from primers with annealing sites on the recombinant plasmids and by flanking the insert were used as templates for amplification of the diagnostic products using factor XI gene-specific primers. An equimolar mixture of both PCR amplicons, originating from pFXI-N and pFXI-M, constituted the carrier control while the individual amplicons were the affected and normal controls. The controls were used as references for in-gel comparison to screen a population of 307 cattle and 259 water buffaloes; the frequency of the mutant allele was found to be 0. No DNA size standards were required in this study. The simulated control DNA samples representing normal, carrier and affected cattle have the potential to help in large-scale screening of a cattle population for individuals that are carriers or affected by factor XI deficiency. [Abstract/Link to Full Text]

Clarizia AD, Bastos-Rodrigues L, Pena HB, Anacleto C, Rossi B, Soares FA, Lopes A, Rocha JC, Caballero O, Camargo A, Simpson AJ, Pena SD
Relationship of the methylenetetrahydrofolate reductase C677T polymorphism with microsatellite instability and promoter hypermethylation in sporadic colorectal cancer.
Genet Mol Res. 2006;5(2):315-22.
The methylenetetrahydrofolate reductase (MTHFR) C677T polymorphism is associated with the expression of a thermolabile enzyme with decreased activity that influences the pool of methyl-donor molecules. Several studies have reported an association between C677T polymorphism and susceptibility to colorectal cancer (CRC). Considering that methylation abnormalities appear to be important for the pathogenesis of CRC, we examined the correlation between the genotype of the MTHFR C677T polymorphism, hypermethylation of the promoter region of five relevant genes (DAPK, MGMT, hMLH1, p16(INK4a), and p14(ARF)), and microsatellite instability, in 106 patients with primary CRCs in Brazil. We did not find significant differences in the genotypic frequencies of the MTHFR C677T polymorphism when one or more loci were hypermethylated. However, we did find a significant excess of 677TT individuals among patients with CRC who had microsatellite instability. This strong association was independent of the methylation status of hMLH1 and of the biogeographical genomic ancestry of the patients. Although the mechanism responsible for the link between the C677T polymorphism and microsatellite instability was not apparent, this finding may provide a clue towards a better understanding of the pathogenesis of microsatellite instability in human colorectal cancer. [Abstract/Link to Full Text]

Dario C, Carnicella D, Dario M, Bufano G
Morphological evolution and heritability estimates for some biometric traits in the Murgese horse breed.
Genet Mol Res. 2006;5(2):309-14.
A data set concerning 1,816 subjects entered in the Italian Horse Registry from 1925 to 2002 was analyzed to investigate the morphological evolution of the Murgese horse and to obtain useful elements to enhance breeding practices. Three basic body measurements (height at withers, chest girth, and cannon bone circumference) were considered for each subject. Heritabilities were calculated for each parameter to infer the growth and development traits of this breed. Over the past 20 years the Murgese horse has undergone considerable changes, passing from a typical mesomorphic structure (height at withers: 156.30 and 151.04 cm; chest girth: 185.80 and 176.11 cm; cannon bone: 21.10 and 19.82 cm for males and females, respectively) to a mesodolichomorphic structure (height at withers: 160.31 and 156.44 cm; chest girth: 187.89 and 182.48 cm; cannon bone: 21.07 and 20.37 cm, for males and females, respectively). Due to these changes and to its characteristic strength and power, the Murgese, which was once used in agriculture and for meat production (at the end of its life), is now involved in sports, mainly in trekking and equestrian tourism. The heritability estimates for the three body measurements were found to be 0.24, 0.39 and 0.44. [Abstract/Link to Full Text]

de Melo RC, Lopes CE, Fernandes FA, da Silveira CH, Santoro MM, Carceroni RL, Meira W, Ara�jo Ade A
A contact map matching approach to protein structure similarity analysis.
Genet Mol Res. 2006;5(2):284-308.
We modeled the problem of identifying how close two proteins are structurally by measuring the dissimilarity of their contact maps. These contact maps are colored images, in which the chromatic information encodes the chemical nature of the contacts. We studied two conceptually distinct image-processing algorithms to measure the dissimilarity between these contact maps; one was a content-based image retrieval method, and the other was based on image registration. In experiments with contact maps constructed from the protein data bank, our approach was able to identify, with greater than 80% precision, instances of monomers of apolipoproteins, globins, plastocyanins, retinol binding proteins and thioredoxins, among the monomers of Protein Data Bank Select. The image registration approach was only slightly more accurate than the content-based image retrieval approach. [Abstract/Link to Full Text]

Nassar NM
Are genetically modified crops compatible with sustainable agriculture?
Genet Mol Res. 2006;5(1):91-2. [Abstract/Link to Full Text]

Quitzau JA, Meidanis J
A fully resolved consensus between fully resolved phylogenetic trees.
Genet Mol Res. 2006;5(1):269-83.
Nowadays, there are many phylogeny reconstruction methods, each with advantages and disadvantages. We explored the advantages of each method, putting together the common parts of trees constructed by several methods, by means of a consensus computation. A number of phylogenetic consensus methods are already known. Unfortunately, there is also a taboo concerning consensus methods, because most biologists see them mainly as comparators and not as phylogenetic tree constructors. We challenged this taboo by defining a consensus method that builds a fully resolved phylogenetic tree based on the most common parts of fully resolved trees in a given collection. We also generated results showing that this consensus is in a way a kind of "median" of the input trees; as such it can be closer to the correct tree in many situations. [Abstract/Link to Full Text]

Veiga DF, Vicente FF, Bastos G
Gene networks as a tool to understand transcriptional regulation.
Genet Mol Res. 2006;5(1):254-68.
Gene regulatory networks, or simply gene networks (GNs), have shown to be a promising approach that the bioinformatics community has been developing for studying regulatory mechanisms in biological systems. GNs are built from the genome-wide high-throughput gene expression data that are often available from DNA microarray experiments. Conceptually, GNs are (un)directed graphs, where the nodes correspond to the genes and a link between a pair of genes denotes a regulatory interaction that occurs at transcriptional level. In the present study, we had two objectives: 1) to develop a framework for GN reconstruction based on a Bayesian network model that captures direct interactions between genes through nonparametric regression with B-splines, and 2) to demonstrate the potential of GNs in the analysis of expression data of a real biological system, the yeast pheromone response pathway. Our framework also included a number of search schemes to learn the network. We present an intuitive notion of GN theory as well as the detailed mathematical foundations of the model. A comprehensive analysis of the consistency of the model when tested with biological data was done through the analysis of the GNs inferred for the yeast pheromone pathway. Our results agree fairly well with what was expected based on the literature, and we developed some hypotheses about this system. Using this analysis, we intended to provide a guide on how GNs can be effectively used to study transcriptional regulation. We also discussed the limitations of GNs and the future direction of network analysis for genomic data. The software is available upon request. [Abstract/Link to Full Text]

Mudado Mde A, Ortega JM
A picture of gene sampling/expression in model organisms using ESTs and KOG proteins.
Genet Mol Res. 2006;5(1):242-53.
The expressed sequence tag (EST) is an instrument of gene discovery. When available in large numbers, ESTs may be used to estimate gene expression. We analyzed gene expression by EST sampling, using the KOG database, which includes 24,154 proteins from Arabidopsis thaliana (Ath), 17,101 from Caenorhabditis elegans (Cel), 10,517 from Drosophila melanogaster (Dme), and 26,324 from Homo sapiens (Hsa), and 178,538 ESTs for Ath, 215,200 for Cel, 261,404 for Dme, and 1,941,556 for Hsa. BLAST similarity searches were performed to assign KOG annotation to all ESTs. We determined the amount of gene sampling or expression dedicated to each KOG functional category by each model organism. We found that the 25% most-expressed genes are frequently shared among these organisms. The KOG protein classification allowed the EST sampling calculation throughout the glycolysis pathway. We calculated the KOG cluster coverage and inferred that 50 to 80 K ESTs would efficiently cover 80-85% of the KOG database clusters in a transcriptome project. Since KOG is a database biased towards housekeeping genes, this is probably the number of ESTs needed to include the more commonly expressed genes in these organisms. We also examined a still unaddressed question: what is the minimum number of ESTs that should be produced in a transcriptome project? [Abstract/Link to Full Text]

Schrago CG
An empirical examination of the standard errors of maximum likelihood phylogenetic parameters under the molecular clock via bootstrapping.
Genet Mol Res. 2006;5(1):233-41.
The molecular clock theory has greatly enlightened our understanding of macroevolutionary events. Maximum likelihood (ML) estimation of divergence times involves the adoption of fixed calibration points, and the confidence intervals associated with the estimates are generally very narrow. The credibility intervals are inferred assuming that the estimates are normally distributed, which may not be the case. Moreover, calculation of standard errors is usually carried out by the curvature method and is complicated by the difficulty in approximating second derivatives of the likelihood function. In this study, a standard primate phylogeny was used to examine the standard errors of ML estimates via the bootstrap method. Confidence intervals were also assessed from the posterior distribution of divergence times inferred via Bayesian Markov Chain Monte Carlo. For the primate topology under evaluation, no significant differences were found between the bootstrap and the curvature methods. Also, Bayesian confidence intervals were always wider than those obtained by ML. [Abstract/Link to Full Text]

Saha S, Heber S
In silico prediction of yeast deletion phenotypes.
Genet Mol Res. 2006;5(1):224-32.
Analysis of gene deletions is a fundamental approach for investigating gene function. We evaluated an algorithm that uses classification techniques to predict the phenotypic effects of gene deletions in yeast. We used a modified simulated annealing algorithm for feature selection and weighting. The selected features with high weights were phylogenetic conservation scores for bacteria, fungi (excluding Ascomycota), Ascomycota (excluding Saccharomyces cerevisiae), plants, and mammals, degree of paralogy, and number of protein-protein interactions. Classification was performed by weighted k-nearest neighbor and with support vector machine algorithms. To demonstrate how this approach might complement existing experimental procedures, we applied our algorithm to predict essential genes and genes causing morphological alterations in yeast. [Abstract/Link to Full Text]

Bezerra WM, Carvalho CP, Moreira Rde A, Grangeiro TB
Establishment of a heterologous system for the expression of Canavalia brasiliensis lectin: a model for the study of protein splicing.
Genet Mol Res. 2006;5(1):216-23.
During its biosynthesis in developing Canavalia brasiliensis seeds, the lectin ConBr undergoes a form of protein splicing in which the order of the N- and C-domains of the protein is reversed. To investigate whether these events can occur in other eukaryotic organisms, an expression system based on Pichia pastoris cells was established. A DNA fragment encoding prepro-ConBr was cloned into the vector pPICZB, and the recombinant plasmid was transformed in P. pastoris strain GS115. Ten clones were screened for effective recombinant protein production. Based on Western blot analysis of the two clones with the highest level of protein expression: 1) diffuse high-molecular mass immunoreactive bands were produced as early as 24 h after induction; 2) a single-, high-molecular mass protein was secreted into the medium, and 3) a significant fraction of the recombinant polypeptides that cross-reacted with anti-ConBr antibodies comprised a band of approximately 34.5 kDa. Diffuse protein bands with high molecular masses are attributed to hyperglycosylation at the single potential N-glycosylation site located in the linker peptide of prepro-ConBr. In contrast, native ConBr is made up of three polypeptides, the intact alpha chain (aa 1-237) and the fragments beta (aa 1-118) and gamma (aa 119-237), which have apparent molecular masses of 30, 16 and 12 kDa, respectively. Apparently, the yeast P. pastoris is not able to carry out all the complex post-translational proteolytic processing necessary for the biosynthesis of ConBr. [Abstract/Link to Full Text]

Ara�jo LV, Soares MA, Oliveira SM, Chequer P, Tanuri A, Sabino EC, Ferreira JE
DBCollHIV: a database system for collaborative HIV analysis in Brazil.
Genet Mol Res. 2006;5(1):203-15.
We developed a database system for collaborative HIV analysis (DBCollHIV) in Brazil. The main purpose of our DBCollHIV project was to develop an HIV-integrated database system with analytical bioinformatics tools that would support the needs of Brazilian research groups for data storage and sequence analysis. Whenever authorized by the principal investigator, this system also allows the integration of data from different studies and/or the release of the data to the general public. The development of a database that combines sequences associated with clinical/epidemiological data is difficult without the active support of interdisciplinary investigators. A functional database that securely stores data and helps the investigator to manipulate their sequences before publication would be an attractive tool for investigators depositing their data and collaborating with other groups. DBCollHIV allows investigators to manipulate their own datasets, as well as integrating molecular and clinical HIV data, in an innovative fashion. [Abstract/Link to Full Text]

Borro LC, Oliveira SR, Yamagishi ME, Mancini AL, Jardine JG, Mazoni I, Santos EH, Higa RH, Kuser PR, Neshich G
Predicting enzyme class from protein structure using Bayesian classification.
Genet Mol Res. 2006;5(1):193-202.
Predicting enzyme class from protein structure parameters is a challenging problem in protein analysis. We developed a method to predict enzyme class that combines the strengths of statistical and data-mining methods. This method has a strong mathematical foundation and is simple to implement, achieving an accuracy of 45%. A comparison with the methods found in the literature designed to predict enzyme class showed that our method outperforms the existing methods. [Abstract/Link to Full Text]

Silva JP, Lemke N, Mombach JC, Souza JG, Sinigaglia M, Vieira R
Exploring molecular networks using MONET ontology.
Genet Mol Res. 2006;5(1):182-92.
The description of the complex molecular network responsible for cell behavior requires new tools to integrate large quantities of experimental data in the design of biological information systems. These tools could be used in the characterization of these networks and in the formulation of relevant biological hypotheses. The building of an ontology is a crucial step because it integrates in a coherent framework the concepts necessary to accomplish such a task. We present MONET (molecular network), an extensible ontology and an architecture designed to facilitate the integration of data originating from different public databases in a single- and well-documented relational database, that is compatible with MONET formal definition. We also present an example of an application that can easily be implemented using these tools. [Abstract/Link to Full Text]

Baudet C, Dias Z
Analysis of slipped sequences in EST projects.
Genet Mol Res. 2006;5(1):169-81.
Slippage is an important sequencing problem that can occur in EST projects. However, very few studies have addressed this. We propose three new methods to detect slippage artifacts: arithmetic mean method, geometric mean method, and echo coverage method. Each method is simple and has two different strategies for processing sequences: suffix and subsequence. Using the 291,689 EST sequences produced in the SUCEST project, we performed comparative tests between our proposed methods and the SUCEST method. The subsequence strategy is better than the suffix strategy, because it is not anchored at the end of the sequence, so it is more flexible to find slippage at the beginning of the EST. In a comparison with the SUCEST method, the advantage of our methods is that they do not discard the majority of the sequences marked as slippage, but instead only remove the slipped artifact from the sequence. Based on our tests the echo coverage method with subsequence strategy shows the best compromise between slippage detection and ease of calibration. [Abstract/Link to Full Text]

Cristino AS, Nascimento AM, Costa Lda F, Sim�es ZL
A comparative analysis of highly conserved sex-determining genes between Apis mellifera and Drosophila melanogaster.
Genet Mol Res. 2006;5(1):154-68.
A comparison of the most conserved sex-determining genes between the fruit fly, Drosophila melanogaster, and the honey bee, Apis mellifera, was performed with bioinformatics tools developed for computational molecular biology. An initial set of protein sequences already described in the fruit fly as participants of the sex-determining cascade was retrieved from the Gene Ontology database (http://www.geneontology.org/) and aligned against a database of protein sequences predicted from the honey bee genome. The doublesex (dsx) gene is considered one of the most conserved sex-determining genes among metazoans, and a male-specific partial cDNA of putative A. mellifera dsx gene (Amdsx) was identified experimentally. The theoretical predictions were developed in the context of sequence similarity. Experimental evidence indicates that dsx is present in embryos and larvae, and that it encodes a transcription factor widely conserved in metazoans, containing a DM DNA-binding domain implicated in the regulation of the expression of genes involved in sexual phenotype formation. [Abstract/Link to Full Text]

Galves M, Quitzau JA, Dias Z
New strategy to detect single nucleotide polymorphisms.
Genet Mol Res. 2006;5(1):143-53.
A great effort has been made to identify and map a large set of single nucleotide polymorphisms. The goal is to determine human DNA variants that contribute most significantly to population variation in each trait. Different algorithms and software packages, such as PolyBayes and PolyPhred, have been developed to address this problem. We present strategies to detect single nucleotide polymorphisms, using chromatogram analysis and consensi of multiple aligned sequences. The algorithms were tested using HIV datasets, and the results were compared with those produced by PolyBayes and PolyPhred using the same dataset. Our algorithms produced significantly better results than these two software packages. [Abstract/Link to Full Text]

V�ncio RZ, Patr�o DF, Baptista CS, Pereira CA, Zingales B
BayBoots: a model-free Bayesian tool to identify class markers from gene expression data.
Genet Mol Res. 2006;5(1):138-42.
One of the goals of gene expression experiments is the identification of differentially expressed genes among populations that could be used as markers. For this purpose, we implemented a model-free Bayesian approach in a user-friendly and freely available web-based tool called BayBoots. In spite of a common misunderstanding that Bayesian and model-free approaches are incompatible, we merged them in the BayBoots implementation using the Kernel density estimator and Rubin 's Bayesian Bootstrap. We used the Bayes error rate (BER) instead of the usual P values as an alternative statistical index to rank a class marker's discriminative potential, since it can be visualized by a simple graphical representation and has an intuitive interpretation. Subsequently, Bayesian Bootstrap was used to assess BER 's credibility. We tested BayBoots on microarray data to look for markers for Trypanosoma cruzi strains isolated from cardiac and asymptomatic patients. We found that the three most frequently used methods in microarray analysis: t-test, non-parametric Wilcoxon test and correlation methods, yielded several markers that were discarded by a time-consuming visual check. On the other hand, the BayBoots graphical output and ranking was able to automatically identify markers for which classification performance was consistent. BayBoots is available at: http://www.vision.ime.usp.br/~rvencio/BayBoots. [Abstract/Link to Full Text]

Higa RH, Cruz SA, Kuser PR, Yamagishi ME, Fileto R, Oliveira SR, Mazoni I, Santos EH, Mancini AL, Neshich G
Building multiple sequence alignments with a flavor of HSSP alignments.
Genet Mol Res. 2006;5(1):127-37.
Homology-derived secondary structure of proteins (HSSP) is a well-known database of multiple sequence alignments (MSAs) which merges information of protein sequences and their three-dimensional structures. It is available for all proteins whose structure is deposited in the PDB. It is also used by STING and (Java)Protein Dossier to calculate and present relative entropy as a measure of the degree of conservation for each residue of proteins whose structure has been solved and deposited in the PDB. However, if the STING and (Java)Protein Dossier are to provide support for analysis of protein structures modeled in computers or being experimentally solved but not yet deposited in the PDB, then we need a new method for building alignments having a flavor of HSSP alignments (myMSAr). The present study describes a new method and its corresponding databank (SH2QS--database of sequences homologue to the query [structure-having] sequence). Our main interest in making myMSAr was to measure the degree of residue conservation for a given query sequence, regardless of whether it has a corresponding structure deposited in the PDB. In this study, we compare the measurement of residue conservation provided by corresponding alignments produced by HSSP and SH2QS. As a case study, we also present two biologically relevant examples, the first one highlighting the equivalence of analysis of the degree of residue conservation by using HSSP or SH2QS alignments, and the second one presenting the degree of residue conservation for a structure modeled in a computer, which , as a consequence, does not have an alignment reported by HSSP. [Abstract/Link to Full Text]

Catanho M, Mascarenhas D, Degrave W, Miranda AB
GenoMycDB: a database for comparative analysis of mycobacterial genes and genomes.
Genet Mol Res. 2006;5(1):115-26.
Several databases and computational tools have been created with the aim of organizing, integrating and analyzing the wealth of information generated by large-scale sequencing projects of mycobacterial genomes and those of other organisms. However, with very few exceptions, these databases and tools do not allow for massive and/or dynamic comparison of these data. GenoMycDB (http://www.dbbm.fiocruz.br/GenoMycDB) is a relational database built for large-scale comparative analyses of completely sequenced mycobacterial genomes, based on their predicted protein content. Its central structure is composed of the results obtained after pair-wise sequence alignments among all the predicted proteins coded by the genomes of six mycobacteria: Mycobacterium tuberculosis (strains H37Rv and CDC1551), M. bovis AF2122/97, M. avium subsp. paratuberculosis K10, M. leprae TN, and M. smegmatis MC2 155. The database stores the computed similarity parameters of every aligned pair, providing for each protein sequence the predicted subcellular localization, the assigned cluster of orthologous groups, the features of the corresponding gene, and links to several important databases. Tables containing pairs or groups of potential homologs between selected species/strains can be produced dynamically by user-defined criteria, based on one or multiple sequence similarity parameters. In addition, searches can be restricted according to the predicted subcellular localization of the protein, the DNA strand of the corresponding gene and/or the description of the protein. Massive data search and/or retrieval are available, and different ways of exporting the result are offered. GenoMycDB provides an on-line resource for the functional classification of mycobacterial proteins as well as for the analysis of genome structure, organization, and evolution. [Abstract/Link to Full Text]

Pereira GS, Brand�o RM, Giuliatti S, Zago MA, Silva WA
Gene Class expression: analysis tool of Gene Ontology terms with gene expression data.
Genet Mol Res. 2006;5(1):108-14.
Serial analysis of gene expression (SAGE) technology produces large sets of interesting genes that are difficult to analyze directly. Bioinformatics tools are needed to interpret the functional information in these gene sets. We present an interactive web-based tool, called Gene Class, which allows functional annotation of SAGE data using the Gene Ontology (GO) database. This tool performs searches in the GO database for each SAGE tag, making associations in the selected GO category for a level selected in the hierarchy. This system provides user-friendly data navigation and visualization for mapping SAGE data onto the gene ontology structure. This tool also provides graphical visualization of the percentage of SAGE tags in each GO category, along with confidence intervals and hypothesis testing. [Abstract/Link to Full Text]

Koide T, Salem-Izacc SM, Gomes SL, V�ncio RZ
SpotWhatR: a user-friendly microarray data analysis system.
Genet Mol Res. 2006;5(1):93-107.
SpotWhatR is a user-friendly microarray data analysis tool that runs under a widely and freely available R statistical language (http://www.r-project.org) for Windows and Linux operational systems. The aim of SpotWhatR is to help the researcher to analyze microarray data by providing basic tools for data visualization, normalization, determination of differentially expressed genes, summarization by Gene Ontology terms, and clustering analysis. SpotWhatR allows researchers who are not familiar with computational programming to choose the most suitable analysis for their microarray dataset. Along with well-known procedures used in microarray data analysis, we have introduced a stand-alone implementation of the HTself method, especially designed to find differentially expressed genes in low-replication contexts. This approach is more compatible with our local reality than the usual statistical methods. We provide several examples derived from the Blastocladiella emersonii and Xylella fastidiosa Microarray Projects. SpotWhatR is freely available at http://blasto.iq.usp.br/~tkoide/SpotWhatR, in English and Portuguese versions. In addition, the user can choose between "single experiment" and "batch processing" versions. [Abstract/Link to Full Text]

Teixeira DI, Melo LM, Gadelha CA, Cunha RM, Bloch C, R�dis-Baptista G, Cavada BS, Freitas VJ
Ion-exchange chromatography used to isolate a spermadhesin-related protein from domestic goat (Capra hircus) seminal plasma.
Genet Mol Res. 2006;5(1):79-87.
Mammalian seminal plasma contains among others, proteins called spermadhesins, which are the major proteins of boar and stallion seminal plasma. These proteins appear to be involved in capacitation and sperm-egg interaction. Previously, we reported the presence of a protein related to spermadhesins in goat seminal plasma. In the present study, we have further characterized this protein, and we propose ion-exchange chromatography to isolate this seminal protein. Semen was obtained from four adult Saanen bucks. Seminal plasma was pooled, dialyzed against distilled water and freeze-dried. Lyophilized proteins were loaded onto an ion-exchange chromatography column. Dialyzed-lyophilized proteins from the main peak of DEAE-Sephacel were applied to a C2/C18 column coupled to an RP-HPLC system, and the eluted proteins were lyophilized for electrophoresis. The N-terminal was sequenced and amino acid sequence similarity was determined using CLUSTAL W. Additionally, proteins from DEAE-Sephacel chromatography step were dialyzed and submitted to a heparin-Sepharose high-performance liquid chromatography. Goat seminal plasma after ion-exchange chromatography yielded 6.47 +/- 0.63 mg (mean +/- SEM) of the major retained fraction. The protein was designated BSFP (buck seminal fluid protein). BSFP exhibited N-terminal sequence homology to boar, stallion and bull spermadhesins. BSFP showed no heparin-binding capabilities. These results together with our previous data indicate that goat seminal plasma contains a protein that is structurally related to proteins of the spermadhesin family. Finally, this protein can be efficiently isolated by ion-exchange and reverse-phase chromatography. [Abstract/Link to Full Text]

Recent Articles in BMC Medical Genetics

Maranda B, Lemieux N, Lemyre E
Familial deletion 18p syndrome: case report.
BMC Med Genet. 2006;760.
BACKGROUND: Deletion 18p is a frequent deletion syndrome characterized by dysmorphic features, growth deficiencies, and mental retardation with a poorer verbal performance. Until now, five families have been described with limited clinical description. We report transmission of deletion 18p from a mother to her two daughters and review the previous cases. CASE PRESENTATION: The proband is 12 years old and has short stature, dysmorphic features and moderate mental retardation. Her sister is 9 years old and also has short stature and similar dysmorphic features. Her cognitive performance is within the borderline to mild mental retardation range. The mother also presents short stature. Psychological evaluation showed moderate mental retardation. Chromosome analysis from the sisters and their mother revealed the same chromosomal deletion: 46, XX, del(18)(p11.2). Previous familial cases were consistent regarding the transmission of mental retardation. Our family differs in this regard with variable cognitive impairment and does not display poorer verbal than non-verbal abilities. An exclusive maternal transmission is observed throughout those families. Women with del(18p) are fertile and seem to have a normal miscarriage rate. CONCLUSION: Genetic counseling for these patients should take into account a greater range of cognitive outcome than previously reported. [Abstract/Link to Full Text]

Maciolek NL, Alward WL, Murray JC, Semina EV, McNally MT
Analysis of RNA splicing defects in PITX2 mutants supports a gene dosage model of Axenfeld-Rieger syndrome.
BMC Med Genet. 2006;759.
BACKGROUND: Axenfeld-Rieger syndrome (ARS) is associated with mutations in the PITX2 gene that encodes a homeobox transcription factor. Several intronic PITX2 mutations have been reported in Axenfeld-Rieger patients but their effects on gene expression have not been tested. METHODS: We present two new families with recurrent PITX2 intronic mutations and use PITX2c minigenes and transfected cells to address the hypothesis that intronic mutations effect RNA splicing. Three PITX2 mutations have been analyzed: a G>T mutation within the AG 3' splice site (ss) junction associated with exon 4 (IVS4-1G>T), a G>C mutation at position +5 of the 5' (ss) of exon 4 (IVS4+5G>C), and a previously reported A>G substitution at position -11 of 3'ss of exon 5 (IVS5-11A>G). RESULTS: Mutation IVS4+5G>C showed 71% retention of the intron between exons 4 and 5, and poorly expressed protein. Wild-type protein levels were proportionally expressed from correctly spliced mRNA. The G>T mutation within the exon 4 AG 3'ss junction shifted splicing exclusively to a new AG and resulted in a severely truncated, poorly expressed protein. Finally, the A>G substitution at position -11 of the 3'ss of exon 5 shifted splicing exclusively to a newly created upstream AG and resulted in generation of a protein with a truncated homeodomain. CONCLUSION: This is the first direct evidence to support aberrant RNA splicing as the mechanism underlying the disorder in some patients and suggests that the magnitude of the splicing defect may contribute to the variability of ARS phenotypes, in support of a gene dosage model of Axenfeld-Rieger syndrome. [Abstract/Link to Full Text]

Borlak J, Reamon-Buettner SM
N-acetyltransferase 2 (NAT2) gene polymorphisms in colon and lung cancer patients.
BMC Med Genet. 2006;758.
BACKGROUND: N-acetyltransferase 2 (NAT2) metabolizes arylamines and hydrazines moeities found in many therapeutic drugs, chemicals and carcinogens. The gene encoding NAT2 is polymorphic, thus resulting in rapid or slow acetylator phenotypes. The acetylator status may, therefore, predispose drug-induced toxicities and cancer risks, such as bladder, colon and lung cancer. Indeed, some studies demonstrate a positive association between NAT2 rapid acetylator phenotype and colon cancer, but results are inconsistent. The role of NAT2 acetylation status in lung cancer is likewise unclear, in which both the rapid and slow acetylator genotypes have been associated with disease. METHODS: We investigated three genetic variations, c.481C>T, c.590G>A (p.R197Q) and c.857G>A (p.G286E), of the NAT2 gene, which are known to result in a slow acetylator phenotype. Using validated PCR-RFLP assays, we genotyped 243 healthy unrelated Caucasian control subjects, 92 colon and 67 lung cancer patients for these genetic variations. As there is a recent meta-analysis of NAT2 studies on colon cancer (unlike in lung cancer), we have also undertaken a systematic review of NAT2 studies on lung cancer, and we incorporated our results in a meta-analysis consisting of 16 studies, 3,865 lung cancer patients and 6,077 control subjects. RESULTS: We did not obtain statistically significant differences in NAT2 allele and genotype frequencies in colon cancer patients and control group. Certain genotypes, however, such as [c.590AA+c.857GA] and [c.590GA+c.857GA] were absent among the colon cancer patients. Similarly, allele frequencies in lung cancer patients and controls did not differ significantly. Nevertheless, there was a significant increase of genotypes [c.590GA] and [c.481CT+c.590GA], but absence of homozygous c.590AA and [c.590AA+c.857GA] in the lung cancer group. Meta-analysis of 16 NAT2 studies on lung cancer did not evidence an overall association of the rapid or slow acetylator status to lung cancer. Similarly, the summary odds ratios obtained with stratified meta-analysis based on ethnicity, and smoking status were not significant. CONCLUSION: Our study failed to show an overall association of NAT2 genotypes to either colon or lung cancer risk. [Abstract/Link to Full Text]

Wang CY, Nguyen ND, Morrison NA, Eisman JA, Center JR, Nguyen TV
Beta3-adrenergic receptor gene, body mass index, bone mineral density and fracture risk in elderly men and women: the Dubbo Osteoporosis Epidemiology Study (DOES).
BMC Med Genet. 2006;757.
BACKGROUND: Recent studies have suggested that the Arg allele of beta3-adrenergic receptor (ADRB3) gene is associated with body mass index (BMI), which is an important predictor of bone mineral density (BMD) and fracture risk. However, whether the ADRB3 gene polymorphism is associated with fracture risk has not been investigated. The aim of study was to examine the inter-relationships between ADRB3 gene polymorphisms, BMI, BMD and fracture risk in elderly Caucasians. METHODS: Genotypes of the ADRB3 gene were determined in 265 men and 446 women aged 60+ in 1989 at entry into the study, whose BMD were measured by DXA (GE Lunar, WI USA) at baseline. During the follow-up period (between 1989 and 2004), fractures were ascertained by reviewing radiography reports and personal interviews. RESULTS: The allelic frequencies of the Trp and the Arg alleles were 0.925 and 0.075 respectively, and the relative frequencies of genotypes Trp/Trp, Trp/Arg and Arg/Arg 0.857, 0.138 and 0.006 respectively. There was no significant association between BMI and ADRB3 genotypes (p = 0.10 in women and p = 0.68 in men). There was also no significant association between ADRB3 genotypes and lumbar spine or femoral neck BMD in either men and women. Furthermore, there were no significant association between ADRB3 genotypes and fracture risk in both women and men, either before or after adjusting for and, BMD and BMI. CONCLUSION: The present data suggested that in Caucasian population the contribution of ADRB3 genotypes to the prediction of BMI, BMD and fracture risk is limited. [Abstract/Link to Full Text]

Ortiz J, Fern�ndez-Arquero M, Urcelay E, L�pez-Mej�as R, Ferreira A, Font�n G, de la Concha EG, Mart�nez A
Interleukin-10 polymorphisms in Spanish IgA deficiency patients: a case-control and family study.
BMC Med Genet. 2006;756.
BACKGROUND: IgA deficiency (IgAD) is the most common primary immunodeficiency in Caucasians. Genetic and environmental factors are suspected to be involved in the development of the disease. Interleukin-10 (IL-10) is a cytokine with stimulatory activity on immunoglobulin production and it may be an important regulator in IgAD pathogenesis. The IL-10 gene contains several single nucleotide polymorphisms (SNPs) and two polymorphic microsatellites located in the 5'-flanking region. Our aim was to ascertain if any of these polymorphic markers are associated or linked to IgAD in Spanish patients. METHODS: We genotyped 278 patients with IgAD and 573 ethnically matched controls for the microsatellites IL-10R and IL-10G and for three single nucleotide polymorphisms at positions -1082, -819 and -592 in the proximal promoter of the gene. We also included in this study the parents of 194 patients in order to study the IL-10 haplotypes transmitted and not transmitted to the affected offspring. RESULTS: The only allele where a significant difference was observed in the comparison between IgA deficiency patients and controls was the IL-10G12 allele (OR = 1.58 and p = 0.021). However, this p value could not withstand a Bonferroni correction. None of the IL-10R or promoter SNP alleles was found at a different frequency when patients were compared with controls. CONCLUSION: Our data do not show any significant difference in IL-10 polymorphism frequencies between control and IgAD patient samples. Their haplotype distribution among patients and controls was also equivalent and therefore these microsatellites and SNPs do not seem to influence IgAD susceptibility. [Abstract/Link to Full Text]

Nissen PH, Damgaard D, Stenderup A, Nielsen GG, Larsen ML, Faergeman O
Genomic characterization of five deletions in the LDL receptor gene in Danish Familial Hypercholesterolemic subjects.
BMC Med Genet. 2006;755.
BACKGROUND: Familial Hypercholesterolemia is a common autosomal dominantly inherited disease that is most frequently caused by mutations in the gene encoding the receptor for low density lipoproteins (LDLR). Deletions and other major structural rearrangements of the LDLR gene account for approximately 5% of the mutations in many populations. METHODS: Five genomic deletions in the LDLR gene were characterized by amplification of mutated alleles and sequencing to identify genomic breakpoints. A diagnostic assay based on duplex PCR for the exon 7-8 deletion was developed to discriminate between heterozygotes and normals, and bioinformatic analyses were used to identify interspersed repeats flanking the deletions. RESULTS: In one case 15 bp had been inserted at the site of the deleted DNA, and, in all five cases, Alu elements flanked the sites where deletions had occurred. An assay developed to discriminate the wildtype and the deletion allele in a simple duplex PCR detected three FH patients as heterozygotes, and two individuals with normal lipid values were detected as normal homozygotes. CONCLUSION: The identification of the breakpoints should make it possible to develop specific tests for these mutations, and the data provide further evidence for the role of Alu repeats in intragenic deletions. [Abstract/Link to Full Text]

Santiago JL, Mart�nez A, de la Calle H, Fern�ndez-Arquero M, Figueredo MA, de la Concha EG, Urcelay E
Evidence for the association of the SLC22A4 and SLC22A5 genes with type 1 diabetes: a case control study.
BMC Med Genet. 2006;754.
BACKGROUND: Type 1 diabetes (T1D) is a chronic, autoimmune and multifactorial disease characterized by abnormal metabolism of carbohydrate and fat. Diminished carnitine plasma levels have been previously reported in T1D patients and carnitine increases the sensitivity of the cells to insulin. Polymorphisms in the carnitine transporters, encoded by the SLC22A4 and SLC22A5 genes, have been involved in susceptibility to two other autoimmune diseases, rheumatoid arthritis and Crohn's disease. For these reasons, we investigated for the first time the association with T1D of six single nucleotide polymorphisms (SNPs) mapping to these candidate genes: slc2F2, slc2F11, T306I, L503F, OCTN2-promoter and OCTN2-intron. METHODS: A case-control study was performed in the Spanish population with 295 T1D patients and 508 healthy control subjects. Maximum-likelihood haplotype frequencies were estimated by applying the Expectation-Maximization (EM) algorithm implemented by the Arlequin software. RESULTS: When independently analyzed, one of the tested polymorphisms in the SLC22A4 gene at 1672 showed significant association with T1D in our Spanish cohort. The overall comparison of the inferred haplotypes was significantly different between patients and controls (chi2 = 10.43; p = 0.034) with one of the haplotypes showing a protective effect for T1D (rs3792876/rs1050152/rs2631367/rs274559, CCGA: OR = 0.62 (0.41-0.93); p = 0.02). CONCLUSION: The haplotype distribution in the carnitine transporter locus seems to be significantly different between T1D patients and controls; however, additional studies in independent populations would allow to confirm the role of these genes in T1D risk. [Abstract/Link to Full Text]

Engelfried K, Vorgerd M, Hagedorn M, Haas G, Gilles J, Epplen JT, Meins M
Charcot-Marie-Tooth neuropathy type 2A: novel mutations in the mitofusin 2 gene (MFN2).
BMC Med Genet. 2006;753.
BACKGROUND: Charcot-Marie-Tooth neuropathies are a group of genetically heterogeneous diseases of the peripheral nervous system. Mutations in the MFN2 gene have been reported as the primary cause of Charcot-Marie-Tooth disease type 2A. METHODS: Patients with the clinical diagnosis of Charcot-Marie-Tooth type 2 were screened using single strand conformation polymorphism (SSCP). All DNA samples showing band shifts in the SSCP analysis were amplified from genomic DNA and cycle sequenced. RESULTS: We analyzed a total of 73 unrelated patients with a clinical diagnosis of CMT 2. Overall, novel mutations were detected in 6 patients. c.380G>T (G127V), c.1128G>A (M376I), c.1040A>T (E347V), c.1403G>A (R468H), c.2113G>A (V705I), and c.2258_2259insT (L753fs). CONCLUSION: We confirmed a significant role of mutations in MFN2 in the pathogenesis of Charcot-Marie-Tooth disease type 2. [Abstract/Link to Full Text]

Abu-Amero KK, Al-Boudari OM, Mohamed GH, Dzimiri N
E-selectin S128R polymorphism and severe coronary artery disease in Arabs.
BMC Med Genet. 2006;752.
BACKGROUND: The E-selectin p. S128R (g. A561C) polymorphism has been associated with the presence of angiographic coronary artery disease (CAD) in some populations, but no data is currently available on its association with CAD in Arabs. METHODS: In the present study, we determined the potential relevance of the E-selectin S128R polymorphism for severe CAD and its associated risk factors among Arabs. We genotyped Saudi Arabs for this polymorphism by PCR, followed by restriction enzyme digestion. RESULTS: The polymorphism was determined in 556 angiographically confirmed severe CAD patients and 237 control subjects with no CAD as established angiographically (CON). Frequencies of the S/S, S/R and R/R genotypes were found as 81.1%, 16.6% and 2.3% in CAD patients and 87.8%, 11.8%, and 0.4% in CON subjects, respectively. The frequency of the mutant 128R allele was higher among CAD patients compared to CON group (11% vs. 6%; odds ratio = 1.76; 95% CI 1.14 - 2.72; p = .007), thus indicating a significant association of the 128R allele with CAD among our population. However, the stepwise logistic regression for the 128R allele and different CAD risk factors showed no significant association. CONCLUSION: Among the Saudi population, The E-selectin p. S128R (g. A561C) polymorphism was associated with angiographic CAD in Univariate analysis, but lost its association in multivariate analysis. [Abstract/Link to Full Text]

Freathy RM, Weedon MN, Melzer D, Shields B, Hitman GA, Walker M, McCarthy MI, Hattersley AT, Frayling TM
The functional "KL-VS" variant of KLOTHO is not associated with type 2 diabetes in 5028 UK Caucasians.
BMC Med Genet. 2006;751.
BACKGROUND: Klotho has an important role in insulin signalling and the development of ageing-like phenotypes in mice. The common functional "KL-VS" variant in the KLOTHO (KL) gene is associated with longevity in humans but its role in type 2 diabetes is not known. We performed a large case-control and family-based study to test the hypothesis that KL-VS is associated with type 2 diabetes in a UK Caucasian population. METHODS: We genotyped 1793 cases, 1619 controls and 1616 subjects from 509 families for the single nucleotide polymorphism (SNP) F352V (rs9536314) that defines the KL-VS variant. Allele and genotype frequencies were compared between cases and controls. Family-based analysis was used to test for over- or under-transmission of V352 to affected offspring. RESULTS: Despite good power to detect odds ratios of 1.2, there were no significant associations between alleles or genotypes and type 2 diabetes (V352 allele: odds ratio = 0.96 (0.84-1.09)). Additional analysis of quantitative trait data in 1177 healthy control subjects showed no association of the variant with fasting insulin, glucose, triglycerides, HDL- or LDL-cholesterol (all P > 0.05). However, the HDL-cholesterol levels observed across the genotype groups showed a similar, but non-significant, pattern to previously reported data. CONCLUSION: This is the first large-scale study to examine the association between common functional variation in KL and type 2 diabetes risk. We have found no evidence that the functional KL-VS variant is a risk factor for type 2 diabetes in a large UK Caucasian case-control and family-based study. [Abstract/Link to Full Text]

Marti A, Ochoa MC, S�nchez-Villegas A, Mart�nez JA, Mart�nez-Gonz�lez MA, Hebebrand J, Hinney A, Vedder H
Meta-analysis on the effect of the N363S polymorphism of the glucocorticoid receptor gene (GRL) on human obesity.
BMC Med Genet. 2006;750.
BACKGROUND: Since both excess glucocorticoid secretion and central obesity are clinical features of some obese patients, it is worthwhile to study a possible association of glucocorticoid receptor gene (GRL) variants with obesity. Previous studies have linked the N363S variant of the GRL gene to increased glucocorticoid effects such as higher body fat, a lower lean-body mass and a larger insulin response to dexamethasone. However, contradictory findings have been also reported about the association between this variant and obesity phenotypes. Individual studies may lack statistical power which may result in disparate results. This limitation can be overcome using meta-analytic techniques. METHODS: We conducted a meta-analysis to assess the association between the N363S polymorphism of the GRL gene and obesity risk. In addition to published research, we included also our own unpublished data -three novel case-control studies- in the meta-analysis The new case-control studies were conducted in German and Spanish children, adolescents and adults (total number of subjects: 1,117). Genotype was assessed by PCR-RFLP (Tsp509I). The final formal meta-analysis included a total number of 5,909 individuals. RESULTS: The meta-analysis revealed a higher body mass index (BMI) with an overall estimation of +0.18 kg/m2 (95% CI: +0.004 to +0.35) for homo-/heterozygous carriers of the 363S allele of the GRL gene in comparison to non-carriers. Moreover, differences in pooled BMI were statistically significant and positive when considering one-group studies from the literature in which participants had a BMI below 27 kg/m2 (+ 0.41 kg/m2 [95% CI +0.17 to +0.66]), but the differences in BMI were negative when only our novel data from younger (aged under 45) and normal weight subjects were pooled together (-0.50 kg/m2 [95% CI -0.84 to -0.17]). The overall risk for obesity for homo-/heterozygous carriers of the 363S allele was not statistically significant in the meta-analysis (pooled OR = 1.02; 95% CI: 0.56-1.87). CONCLUSION: Although certain genotypic effects could be population-specific, we conclude that there is no compelling evidence that the N363S polymorphism of the GRL gene is associated with either average BMI or obesity risk. [Abstract/Link to Full Text]

Chowdhury MA, Kuivaniemi H, Romero R, Edwin S, Chaiworapongsa T, Tromp G
Identification of novel functional sequence variants in the gene for peptidase inhibitor 3.
BMC Med Genet. 2006;749.
BACKGROUND: Peptidase inhibitor 3 (PI3) inhibits neutrophil elastase and proteinase-3, and has a potential role in skin and lung diseases as well as in cancer. Genome-wide expression profiling of chorioamniotic membranes revealed decreased expression of PI3 in women with preterm premature rupture of membranes. To elucidate the molecular mechanisms contributing to the decreased expression in amniotic membranes, the PI3 gene was searched for sequence variations and the functional significance of the identified promoter variants was studied. METHODS: Single nucleotide polymorphisms (SNPs) were identified by direct sequencing of PCR products spanning a region from 1,173 bp upstream to 1,266 bp downstream of the translation start site. Fourteen SNPs were genotyped from 112 and nine SNPs from 24 unrelated individuals. Putative transcription factor binding sites as detected by in silico search were verified by electrophoretic mobility shift assay (EMSA) using nuclear extract from Hela and amnion cell nuclear extract. Deviation from Hardy-Weinberg equilibrium (HWE) was tested by chi2 goodness-of-fit test. Haplotypes were estimated using expectation maximization (EM) algorithm. RESULTS: Twenty-three sequence variations were identified by direct sequencing of polymerase chain reaction (PCR) products covering 2,439 nt of the PI3 gene (-1,173 nt of promoter sequences and all three exons). Analysis of 112 unrelated individuals showed that 20 variants had minor allele frequencies (MAF) ranging from 0.02 to 0.46 representing "true polymorphisms", while three had MAF < or = 0.01. Eleven variants were in the promoter region; several putative transcription factor binding sites were found at these sites by database searches. Differential binding of transcription factors was demonstrated at two polymorphic sites by electrophoretic mobility shift assays, both in amniotic and HeLa cell nuclear extracts. Differential binding of the transcription factor GATA1 at -689C>G site was confirmed by a supershift. CONCLUSION: The promoter sequences of PI3 have a high degree of variability. Functional promoter variants provide a possible mechanism for explaining the differences in PI3 mRNA expression levels in the chorioamniotic membranes, and are also likely to be useful in elucidating the role of PI3 in other diseases. [Abstract/Link to Full Text]

S�nchez E, Sabio JM, Callejas JL, de Ram�n E, Garcia-Portales R, Garc�a-Hern�ndez FJ, Jim�nez-Alonso J, Gonz�lez-Escribano MF, Mart�n J, Koeleman BP
Association study of genetic variants of pro-inflammatory chemokine and cytokine genes in systemic lupus erythematosus.
BMC Med Genet. 2006;748.
BACKGROUND: Several lines of evidence suggest that chemokines and cytokines play an important role in the inflammatory development and progression of systemic lupus erythematosus. The aim of this study was to evaluate the relevance of functional genetic variations of RANTES, IL-8, IL-1alpha, and MCP-1 for systemic lupus erythematosus. METHODS: The study was conducted on 500 SLE patients and 481 ethnically matched healthy controls. Genotyping of polymorphisms in the RANTES, IL-8, IL-1alpha, and MCP-1 genes were performed using a real-time polymerase chain reaction (PCR) system with pre-developed TaqMan allelic discrimination assay. RESULTS: No significant differences between SLE patients and healthy controls were observed when comparing genotype, allele or haplotype frequencies of the RANTES, IL-8, IL-1alpha, and MCP-1 polymorphisms. In addition, no evidence for association with clinical sub-features of SLE was found. CONCLUSION: These results suggest that the tested functional variation of RANTES, IL-8, IL-1alpha, and MCP-1 genes do not confer a relevant role in the susceptibility or severity of SLE in the Spanish population. [Abstract/Link to Full Text]

Soler JM, Pereira AC, T�rres CH, Krieger JE
Gene by environment QTL mapping through multiple trait analyses in blood pressure salt-sensitivity: identification of a novel QTL in rat chromosome 5.
BMC Med Genet. 2006;747.
BACKGROUND: The genetic mechanisms underlying interindividual blood pressure variation reflect the complex interplay of both genetic and environmental variables. The current standard statistical methods for detecting genes involved in the regulation mechanisms of complex traits are based on univariate analysis. Few studies have focused on the search for and understanding of quantitative trait loci responsible for gene x environmental interactions or multiple trait analysis. Composite interval mapping has been extended to multiple traits and may be an interesting approach to such a problem. METHODS: We used multiple-trait analysis for quantitative trait locus mapping of loci having different effects on systolic blood pressure with NaCl exposure. Animals studied were 188 rats, the progenies of an F2 rat intercross between the hypertensive and normotensive strain, genotyped in 179 polymorphic markers across the rat genome. To accommodate the correlational structure from measurements taken in the same animals, we applied univariate and multivariate strategies for analyzing the data. RESULTS: We detected a new quantitative train locus on a region close to marker R589 in chromosome 5 of the rat genome, not previously identified through serial analysis of individual traits. In addition, we were able to justify analytically the parametric restrictions in terms of regression coefficients responsible for the gain in precision with the adopted analytical approach. CONCLUSION: Future work should focus on fine mapping and the identification of the causative variant responsible for this quantitative trait locus signal. The multivariable strategy might be valuable in the study of genetic determinants of interindividual variation of antihypertensive drug effectiveness. [Abstract/Link to Full Text]

Kimberley KW, Morris CA, Hobart HH
BAC-FISH refutes report of an 8p22-8p23.1 inversion or duplication in 8 patients with Kabuki syndrome.
BMC Med Genet. 2006;746.
BACKGROUND: Kabuki syndrome is a multiple congenital anomaly/mental retardation syndrome. The syndrome is characterized by varying degrees of mental retardation, postnatal growth retardation, distinct facial characteristics resembling the Kabuki actor's make-up, cleft or high-arched palate, brachydactyly, scoliosis, and persistence of finger pads. The multiple organ involvement suggests that this is a contiguous gene syndrome but no chromosomal anomalies have been isolated as an etiology. Recent studies have focused on possible duplications in the 8p22-8p23.1 region but no consensus has been reached. METHODS: We used bacterial artificial chromosome-fluorescent in-situ hybridization (BAC-FISH) and G-band analysis to study eight patients with Kabuki syndrome. RESULTS: Metaphase analysis revealed no deletions or duplications with any of the BAC probes. Interphase studies of the Kabuki patients yielded no evidence of inversions when using three-color FISH across the region. These results agree with other research groups' findings but disagree with the findings of Milunsky and Huang. CONCLUSION: It seems likely that Kabuki syndrome is not a contiguous gene syndrome of the 8p region studied. [Abstract/Link to Full Text]

Vidal-Taboada JM, Cucala M, Mas Herrero S, Lafuente A, Cobos A
Satisfaction survey with DNA cards method to collect genetic samples for pharmacogenetics studies.
BMC Med Genet. 2006;745.
BACKGROUND: Pharmacogenetic studies are essential in understanding the interindividual variability of drug responses. DNA sample collection for genotyping is a critical step in genetic studies. A method using dried blood samples from finger-puncture, collected on DNA-cards, has been described as an alternative to the usual venepuncture technique. The purpose of this study is to evaluate the implementation of the DNA cards method in a multicentre clinical trial, and to assess the degree of investigators' satisfaction and the acceptance of the patients perceived by the investigators. METHODS: Blood samples were collected on DNA-cards. The quality and quantity of DNA recovered were analyzed. Investigators were questioned regarding their general interest, previous experience, safety issues, preferences and perceived patient satisfaction. RESULTS: 151 patients' blood samples were collected. Genotyping of GST polymorphisms was achieved in all samples (100%). 28 investigators completed the survey. Investigators perceived patient satisfaction as very good (60.7%) or good (39.3%), without reluctance to finger puncture. Investigators preferred this method, which was considered safer and better than the usual methods. All investigators would recommend using it in future genetic studies. CONCLUSION: Within the clinical trial setting, the DNA-cards method was very well accepted by investigators and patients (in perception of investigators), and was preferred to conventional methods due to its ease of use and safety. [Abstract/Link to Full Text]

Cheyssac C, Lecoeur C, Dechaume A, Bibi A, Charpentier G, Balkau B, Marre M, Froguel P, Gibson F, Vaxillaire M
Analysis of common PTPN1 gene variants in type 2 diabetes, obesity and associated phenotypes in the French population.
BMC Med Genet. 2006;744.
BACKGROUND: The protein tyrosine phosphatase-1B, a negative regulator for insulin and leptin signalling, potentially modulates glucose and energy homeostasis. PTP1B is encoded by the PTPN1 gene located on chromosome 20q13 showing linkage with type 2 diabetes (T2D) in several populations. PTPN1 gene variants have been inconsistently associated with T2D, and the aim of our study was to investigate the effect of PTPN1 genetic variations on the risk of T2D, obesity and on the variability of metabolic phenotypes in the French population. METHODS: Fourteen single nucleotide polymorphisms (SNPs) spanning the PTPN1 locus were selected from previous association reports and from HapMap linkage disequilibrium data. SNPs were evaluated for association with T2D in two case-control groups with 1227 cases and 1047 controls. Association with moderate and severe obesity was also tested in a case-control study design. Association with metabolic traits was evaluated in 736 normoglycaemic, non-obese subjects from a general population. Five SNPs showing a trend towards association with T2D, obesity or metabolic parameters were investigated for familial association. RESULTS: From 14 SNPs investigated, only SNP rs914458, located 10 kb downstream of the PTPN1 gene significantly associated with T2D (p = 0.02 under a dominant model; OR = 1.43 [1.06-1.94]) in the combined sample set. SNP rs914458 also showed association with moderate obesity (allelic p = 0.04; OR = 1.2 [1.01-1.43]). When testing for association with metabolic traits, two strongly correlated SNPs, rs941798 and rs2426159, present multiple consistent associations. SNP rs2426159 exhibited evidence of association under a dominant model with glucose homeostasis related traits (p = 0.04 for fasting insulin and HOMA-B) and with lipid markers (0.02 = p = 0.04). Moreover, risk allele homozygotes for this SNP had an increased systolic blood pressure (p = 0.03). No preferential transmission of alleles was observed for the SNPs tested in the family sample. CONCLUSION: In our study, PTPN1 variants showed moderate association with T2D and obesity. However, consistent associations with metabolic variables reflecting insulin resistance and dyslipidemia are found for two intronic SNPs as previously reported. Thus, our data indicate that PTPN1 variants may modulate the lipid profile, thereby influencing susceptibility to metabolic disease. [Abstract/Link to Full Text]

Alsmadi OA, Al-Kayal F, Al-Hamed M, Meyer BF
Frequency of common HFE variants in the Saudi population: a high throughput molecular beacon-based study.
BMC Med Genet. 2006;743.
BACKGROUND: Hereditary Hemochromatosis (HH) is an autosomal recessive disorder highlighted by iron-overload. Two popular mutations in HFE, p.C282Y and p.H63D, have been discovered and found to associate with HH in different ethnic backgrounds. p.C282Y and p.H63D diagnosis is usually made by restriction enzyme analysis. However, the use of this technique is largely limited to research laboratories because they are relatively expensive, time-consuming, and difficult to transform into a high throughput format. METHODS: Single nucleotide variations in target DNA sequences can be readily identified using molecular beacon fluorescent probes. These are quenched probes with loop and hairpin structure, and they become fluorescent upon specific target recognition. We developed high throughput homogeneous real-time PCR assays using molecular beacon technology, to genotype p.C282Y and p.H63D variants. Representative samples of different genotypes for these variants were assayed by restriction enzyme analysis and direct sequencing as bench mark methods for comparison with the newly developed molecular beacon-based real-time PCR assay. RESULTS: Complete concordance was achieved by all three assay formats. Homozygotes (mutant and wildtype) and heterozygotes were readily differentiated by the allele specific molecular beacons as reported by the associated fluorophore in the real-time assay developed in this study. Additionally, these assays were used in a high throughput format to establish the allele frequency of C282Y and H63D in Saudis for the first time. CONCLUSION: These assays may be reliably applied as a diagnostic test or large scale method for population screening. [Abstract/Link to Full Text]

Prasad P, Tiwari AK, Kumar KM, Ammini AC, Gupta A, Gupta R, Sharma AK, Rao AR, Nagendra R, Chandra TS, Tiwari SC, Rastogi P, Gupta BL, Thelma BK
Chronic renal insufficiency among Asian Indians with type 2 diabetes: I. Role of RAAS gene polymorphisms.
BMC Med Genet. 2006;742.
BACKGROUND: Renal failure in diabetes is mediated by multiple pathways. Experimental and clinical evidences suggest that renin-angiotensin-aldosterone system (RAAS) has a crucial role in diabetic kidney disease. A relationship between the RAAS genotypes and chronic renal insufficiency (CRI) among type 2 diabetes subjects has therefore been speculated. We investigated the contribution of selected RAAS gene polymorphisms to CRI among type 2 diabetic Asian Indian subjects. METHODS: Twelve single nucleotide polymorphisms (SNPs) from six genes namely-renin (REN), angiotensinogen (ATG), angiotensin converting enzyme I (ACE), angiotensin II type 1 receptor (AT1) and aldosterone synthase (CYP11B2) gene from the RAAS pathway and one from chymase pathway were genotyped using polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) method and tested for their association with diabetic CRI using a case-control approach. Successive cases presenting to study centres with type 2 diabetes of > or =2 years duration and moderate CRI diagnosed by serum creatinine > or =3 mg/dl after exclusion of non-diabetic causes of CRI (n = 196) were compared with diabetes subjects with no evidence of renal disease (n = 225). Logistic regression analysis was carried out to correlate various clinical parameters with genotypes, and to study pair wise interactions between SNPs of different genes. RESULTS: Of the 12 SNPs genotyped, Glu53Stop in AGT and A>T (-777) in AT1 genes, were monomorphic and not included for further analysis. We observed a highly significant association of Met235Thr SNP in angiotensinogen gene with CRI (O.R. 2.68, 95%CI: 2.01-3.57 for Thr allele, O.R. 2.94, 95%CI: 1.88-4.59 for Thr/Thr genotype and O.R. 2.68, 95%CI: 1.97-3.64 for ACC haplotype). A significant allelic and genotypic association of T>C (-344) SNP in aldosterone synthase gene (O.R. 1.57, 95%CI: 1.16-2.14 and O.R. 1.81, 95%CI: 1.21-2.71 respectively), and genotypic association of GA genotype of G>A (-1903) in chymase gene (O.R. 2.06, 95%CI: 1.34-3.17) were also observed. CONCLUSION: SNPs Met235Thr in angiotensinogen, T>C (-344) in aldosterone synthase, and G>A (-1903) in chymase genes are significantly associated with diabetic chronic renal insufficiency in Indian patients and warrant replication in larger sample sets. Use of such markers for prediction of susceptibility to diabetes specific renal disease in the ethnically Indian population appears promising. [Abstract/Link to Full Text]

Mayeur H, Roche O, V�tu C, Jaliffa C, Marchant D, Dollfus H, Bonneau D, Munier FL, Schorderet DF, Levin AV, H�on E, Sutherland J, Lacombe D, Said E, Mezer E, Kaplan J, Dufier JL, Marsac C, Menasche M, Abitbol M
Eight previously unidentified mutations found in the OA1 ocular albinism gene.
BMC Med Genet. 2006;741.
BACKGROUND: Ocular albinism type 1 (OA1) is an X-linked ocular disorder characterized by a severe reduction in visual acuity, nystagmus, hypopigmentation of the retinal pigmented epithelium, foveal hypoplasia, macromelanosomes in pigmented skin and eye cells, and misrouting of the optical tracts. This disease is primarily caused by mutations in the OA1 gene. METHODS: The ophthalmologic phenotype of the patients and their family members was characterized. We screened for mutations in the OA1 gene by direct sequencing of the nine PCR-amplified exons, and for genomic deletions by PCR-amplification of large DNA fragments. RESULTS: We sequenced the nine exons of the OA1 gene in 72 individuals and found ten different mutations in seven unrelated families and three sporadic cases. The ten mutations include an amino acid substitution and a premature stop codon previously reported by our team, and eight previously unidentified mutations: three amino acid substitutions, a duplication, a deletion, an insertion and two splice-site mutations. The use of a novel Taq polymerase enabled us to amplify large genomic fragments covering the OA1 gene. and to detect very likely six distinct large deletions. Furthermore, we were able to confirm that there was no deletion in twenty one patients where no mutation had been found. CONCLUSION: The identified mutations affect highly conserved amino acids, cause frameshifts or alternative splicing, thus affecting folding of the OA1 G protein coupled receptor, interactions of OA1 with its G protein and/or binding with its ligand. [Abstract/Link to Full Text]

Natividad A, Cooke G, Holland MJ, Burton MJ, Joof HM, Rockett K, Kwiatkowski DP, Mabey DC, Bailey RL
A coding polymorphism in matrix metalloproteinase 9 reduces risk of scarring sequelae of ocular Chlamydia trachomatis infection.
BMC Med Genet. 2006;740.
BACKGROUND: Trachoma, an infectious disease of the conjunctiva caused by Chlamydia trachomatis, is an important global cause of blindness. A dysregulated extracellular matrix (ECM) proteolysis during the processes of tissue repair following infection and inflammation are thought to play a key role in the development of fibrotic sequelae of infection, which ultimately leads to blindness. Expression and activity of matrix metalloproteinase 9 (MMP-9), a major effector of ECM turnover, is up-regulated in the inflamed conjunctiva of trachoma subjects. Genetic variation within the MMP9 gene affects in vitro MMP9 expression levels, enzymatic activity and susceptibility to various inflammatory and fibrotic conditions. METHODS: We genotyped 651 case-control pairs from trachoma endemic villages in The Gambia for coding single nucleotide polymorphisms (SNPs) in the MMP9 gene using the high-throughput Sequenom system. Single marker and haplotype conditional logistic regression (CLR) analysis for disease association was performed. RESULTS: The Q279R mutation located in exon 6 of MMP9 was found to be associated with lower risk for severe disease sequelae of ocular Chlamydia trachomatis infection. This mutation, which leads to a nonsynonymous amino-acid change within the active site of the enzyme may reduce MMP-9-induced degradation of the structural components of the ECM during inflammatory episodes in trachoma and its associated fibrosis. CONCLUSION: This work supports the hypothesis that MMP-9 has a role in the pathogenesis of blinding trachoma. [Abstract/Link to Full Text]

Asselbergs FW, Moore JH, van den Berg MP, Rimm EB, de Boer RA, Dullaart RP, Navis G, van Gilst WH
A role for CETP TaqIB polymorphism in determining susceptibility to atrial fibrillation: a nested case control study.
BMC Med Genet. 2006;739.
BACKGROUND: Studies investigating the genetic and environmental characteristics of atrial fibrillation (AF) may provide new insights in the complex development of AF. We aimed to investigate the association between several environmental factors and loci of candidate genes, which might be related to the presence of AF. METHODS: A nested case-control study within the PREVEND cohort was conducted. Standard 12 lead electrocardiograms were recorded and AF was defined according to Minnesota codes. For every case, an age and gender matched control was selected from the same population (n = 194). In addition to logistic regression analyses, the multifactor-dimensionality reduction (MDR) method and interaction entropy graphs were used for the evaluation of gene-gene and gene-environment interactions. Polymorphisms in genes from the Renin-angiotensin, Bradykinin and CETP systems were included. RESULTS: Subjects with AF had a higher prevalence of electrocardiographic left ventricular hypertrophy, ischemic heart disease, hypertension, renal dysfunction, elevated levels of C-reactive protein (CRP) and increased urinary albumin excretion as compared to controls. The polymorphisms of the Renin-angiotensin system and Bradykinin gene did not show a significant association with AF (p > 0.05). The TaqIB polymorphism of the CETP gene was significantly associated with the presence of AF (p < 0.05). Using the MDR method, the best genotype-phenotype models included the combination of micro- or macroalbuminuria and CETP TaqIB polymorphism, CRP >3 mg/L and CETP TaqIB polymorphism, renal dysfunction and the CETP TaqIB polymorphism, and ischemic heart disease and CETP TaqIB polymorphism (1000 fold permutation testing, P < 0.05). Interaction entropy graph showed that the combination of albuminuria and CETP TaqIB polymorphism removed the most entropy. CONCLUSION: CETP TaqIB polymorphism is significantly associated with the presence of AF in the context of micro- or macroalbuminuria, elevated C-reactive protein, renal dysfunction, and ischemic heart disease. [Abstract/Link to Full Text]

Abu-Amero KK, Al-Boudari OM, Mohamed GH, Dzimiri N
T null and M null genotypes of the glutathione S-transferase gene are risk factor for CAD independent of smoking.
BMC Med Genet. 2006;738.
BACKGROUND: The association of the deletion in GSTT1 and GSTM1 genes with coronary artery disease (CAD) among smokers is controversial. In addition, no such investigation has previously been conducted among Arabs. METHODS: We genotyped 1054 CAD patients and 762 controls for GSTT1 and GSTM1 deletion by multiplex polymerase chain reaction. Both CAD and controls were Saudi Arabs. RESULTS: In the control group (n = 762), 82.3% had the T wild M wildgenotype, 9% had the Twild M null, 2.4% had the Tnull M wild and 6.3% had the Tnull M null genotype. Among the CAD group (n = 1054), 29.5% had the Twild M wild genotype, 26.6% (p < .001) had the Twild M null, 8.3% (p < .001) had the Tnull M wild and 35.6% (p < .001) had the Tnull M null genotype, indicating a significant association of the Twild M null, Tnull M wild and Tnull M null genotypes with CAD. Univariate analysis also showed that smoking, age, hypercholesterolemia and hypertriglyceridemia, diabetes mellitus, family history of CAD, hypertension and obesity are all associated with CAD, whereas gender and myocardial infarction are not. Binary logistic regression for smoking and genotypes indicated that only M null and Tnullare interacting with smoking. However, further subgroup analysis stratifying the data by smoking status suggested that genotype-smoking interactions have no effect on the development of CAD. CONCLUSION: GSTT1 and GSTM1 null-genotypes are risk factor for CAD independent of genotype-smoking interaction. [Abstract/Link to Full Text]

Kashyap VK, Sahoo S, Sitalaximi T, Trivedi R
Deletions in the Y-derived amelogenin gene fragment in the Indian population.
BMC Med Genet. 2006;737.
BACKGROUND: Rare failures in amelogenin-based gender typing of individuals have been observed globally. In this study, we report the deletion of a large fragment of the amelogenin gene in 10 individuals out of 4,257 male samples analyzed from 104 different endogamous populations of India. METHODS: Samples were analyzed using commercial genetic profiling kits. Those that exhibited failures in amelogenin-based gender identification were further analyzed with published as well as newly designed primers to ascertain the nature and extent of mutation. RESULTS: The failure rate among Indian males was 0.23 %. Though the exact size and nature of the deletion (single point mutations at a number of positions or a single large deletion) could not be determined in the present study, it is inferred that the deletion spans a region downstream of the reverse primer-binding site of commercially available amelogenin primer sets. Deletions were conspicuously absent among the Mongoloid tribes of Northeast India, while both caste and tribal groups harbored these mutations, which was predominantly among the Y-chromosomes belonging to J2 lineage. CONCLUSION: Our study indicates that the different amelogenin primer sets currently included in genetic profiling multiplex kits may result in erroneous interpretations due to mutations undetectable during routine testing. Further there are indications that these mutations could possibly be lineage-specific, inherited deletions. [Abstract/Link to Full Text]

Lin TC, Yen JM, Gong KB, Kuo TC, Ku DC, Liang SF, Wu MJ
Abnormal glucose tolerance and insulin resistance in polycystic ovary syndrome amongst the Taiwanese population- not correlated with insulin receptor substrate-1 Gly972Arg/Ala513Pro polymorphism.
BMC Med Genet. 2006;736.
BACKGROUND: Insulin resistance and glucose dysmetabolism in polycystic ovary syndrome (PCOS) are related with the polymorphisms in the genes encoding the insulin receptor substrate (IRS) proteins, especially Gly972Arg/Ala513Pro polymorphism being reported to be associated with type-2 diabetes and PCOS. We intended to assess the prevalence of abnormal glucose tolerance (AGT) and insulin resistance in Taiwanese PCOS women. We also tried to assess whether the particular identity of Gly972Arg/Ala513Pro polymorphic alleles of the IRS-1 gene mutation can be used as an appropriate diagnostic indicator for PCOS. METHODS: We designed a prospective clinical study. Forty-seven Taiwanese Hoklo and Hakka women, diagnosed with PCOS were enrolled in this study as were forty-five healthy Hoklo and Hakka women as the control group. Insulin resistance was evaluated with fasting insulin, fasting glucose/insulin ratio, and homeostasis model assessment index for insulin resistance (HOMAIR). The genomic DNA of the subjects was amplified by PCR and digested by restriction fragmented length polymorphism (RFLP) with Bst N1 used for codon 972 and Dra III for codon 513. RESULTS: AGT was found in 46.8% of these PCOS patients and was significantly related to high insulin resistance rather than the low insulin resistance. Those patients with either insulin resistance or AGT comprised the majority of PCOS affected patients (AGT + fasting insulin > or =17: 83%, AGT + glucose/insulin ratio > or =6.5: 85.1%, AGT + HOMAIR > or = 2: 87.2%, and AGT + HOMAIR > or = 3.8: 72.3%). None of the tested samples revealed any polymorphism due to the absence of any Dra III recognition site or any Bst N1 recognition site in the amplified PCR fragment digested by restriction fragmented length polymorphism. CONCLUSION: There is significantly high prevalence of AGT and insulin resistance in PCOS women, but Gly972Arg and Ala513Pro polymorphic alleles of IRS-1 are rare and are not associated with the elevated risk of PCOS amongst Taiwanese subjects. This is quite different from the similar study in phylogenetically diverged Caucasian subjects. [Abstract/Link to Full Text]

Gamundi MJ, Hernan I, Mart�nez-Gimeno M, Maseras M, Garc�a-Sandoval B, Ayuso C, Anti�olo G, Baiget M, Carballo M
Three novel and the common Arg677Ter RP1 protein truncating mutations causing autosomal dominant retinitis pigmentosa in a Spanish population.
BMC Med Genet. 2006;735.
BACKGROUND: Retinitis pigmentosa (RP), a clinically and genetically heterogeneous group of retinal degeneration disorders affecting the photoreceptor cells, is one of the leading causes of genetic blindness. Mutations in the photoreceptor-specific gene RP1 account for 3-10% of cases of autosomal dominant RP (adRP). Most of these mutations are clustered in a 500 bp region of exon 4 of RP1. METHODS: Denaturing gradient gel electrophoresis (DGGE) analysis and direct genomic sequencing were used to evaluate the 5' coding region of exon 4 of the RP1 gene for mutations in 150 unrelated index adRP patients. Ophthalmic and electrophysiological examination of RP patients and relatives according to pre-existing protocols were carried out. RESULTS: Three novel disease-causing mutations in RP1 were detected: Q686X, K705fsX712 and K722fsX737, predicting truncated proteins. One novel missense mutation, Thr752Met, was detected in one family but the mutation does not co-segregate in the family, thereby excluding this amino acid variation in the protein as a cause of the disease. We found the Arg677Ter mutation, previously reported in other populations, in two independent families, confirming that this mutation is also present in a Spanish population. CONCLUSION: Most of the mutations reported in the RP1 gene associated with adRP are expected to encode mutant truncated proteins that are approximately one third or half of the size of wild type protein. Patients with mutations in RP1 showed mild RP with variability in phenotype severity. We also observed several cases of non-penetrant mutations. [Abstract/Link to Full Text]

Macgregor S, Khan IA
GAIA: an easy-to-use web-based application for interaction analysis of case-control data.
BMC Med Genet. 2006;734.
BACKGROUND: The advent of cheap, large scale genotyping has led to widespread adoption of genetic association mapping as the tool of choice in the search for loci underlying susceptibility to common complex disease. Whilst simple single locus analysis is relatively trivial to conduct, this is not true of more complex analysis such as those involving interactions between loci. The importance of testing for interactions between loci in association analysis has been highlighted in a number of recent high profile publications. RESULTS: Genetic Association Interaction Analysis (GAIA) is a web-based application for testing for statistical interactions between loci. It is based upon the widely used case-control study design for genetic association analysis and is designed so that non-specialists may routinely apply tests for interaction. GAIA allows simple testing of both additive and additive plus dominance interaction models and includes permutation testing to appropriately correct for multiple testing. The application will find use both in candidate gene based studies and in genome-wide association studies. For large scale studies GAIA includes a screening approach which prioritizes loci (based on the significance of main effects at one or both loci) for further interaction analysis. CONCLUSION: GAIA is available at http://www.bbu.cf.ac.uk/html/research/biostats.htm. [Abstract/Link to Full Text]

Homanics GE, Skvorak K, Ferguson C, Watkins S, Paul HS
Production and characterization of murine models of classic and intermediate maple syrup urine disease.
BMC Med Genet. 2006;733.
BACKGROUND: Maple Syrup Urine Disease (MSUD) is an inborn error of metabolism caused by a deficiency of branched-chain keto acid dehydrogenase. MSUD has several clinical phenotypes depending on the degree of enzyme deficiency. Current treatments are not satisfactory and require new approaches to combat this disease. A major hurdle in developing new treatments has been the lack of a suitable animal model. METHODS: To create a murine model of classic MSUD, we used gene targeting and embryonic stem cell technologies to create a mouse line that lacked a functional E2 subunit gene of branched-chain keto acid dehydrogenase. To create a murine model of intermediate MSUD, we used transgenic technology to express a human E2 cDNA on the knockout background. Mice of both models were characterized at the molecular, biochemical, and whole animal levels. RESULTS: By disrupting the E2 subunit gene of branched-chain keto acid dehydrogenase, we created a gene knockout mouse model of classic MSUD. The homozygous knockout mice lacked branched-chain keto acid dehydrogenase activity, E2 immunoreactivity, and had a 3-fold increase in circulating branched-chain amino acids. These metabolic derangements resulted in neonatal lethality. Transgenic expression of a human E2 cDNA in the liver of the E2 knockout animals produced a model of intermediate MSUD. Branched-chain keto acid dehydrogenase activity was 5-6% of normal and was sufficient to allow survival, but was insufficient to normalize circulating branched-chain amino acids levels, which were intermediate between wildtype and the classic MSUD mouse model. CONCLUSION: These mice represent important animal models that closely approximate the phenotype of humans with the classic and intermediate forms of MSUD. These animals provide useful models to further characterize the pathogenesis of MSUD, as well as models to test novel therapeutic strategies, such as gene and cellular therapies, to treat this devastating metabolic disease. [Abstract/Link to Full Text]

N��ez C, Alecsandru D, Varad� J, Polanco I, Maluenda C, Fern�ndez-Arquero M, de la Concha EG, Urcelay E, Mart�nez A
Interleukin-10 haplotypes in Celiac Disease in the Spanish population.
BMC Med Genet. 2006;732.
BACKGROUND: Celiac disease (CD) is a chronic disorder characterized by a pathological inflammatory response after exposure to gluten in genetically susceptible individuals. The HLA complex accounts for less than half of the genetic component of the disease, and additional genes must be implicated. Interleukin-10 (IL-10) is an important regulator of mucosal immunity, and several reports have described alterations of IL-10 levels in celiac patients. The IL-10 gene is located on chromosome 1, and its promoter carries several single nucleotide polymorphisms (SNPs) and microsatellites which have been associated to production levels. Our aim was to study the role of those polymorphisms in susceptibility to CD in our population. METHODS: A case-control and a familial study were performed. Positions -1082, -819 and -592 of the IL-10 promoter were typed by TaqMan and allele specific PCR. IL10R and IL10G microsatellites were amplified with labelled primers, and they were subsequently run on an automatic sequencer. In this study 446 patients and 573 controls were included, all of them white Spaniards. Extended haplotypes encompassing microsatellites and SNPs were obtained in families and estimated in controls by the Expectation-Maximization algorithm. RESULTS: No significant associations after Bonferroni correction were observed in the SNPs or any of the microsatellites. Stratification by HLA-DQ2 (DQA1*0501-DQB1*02) status did not alter the results. When extended haplotypes were analyzed, no differences were apparent either. CONCLUSION: The IL-10 polymorphisms studied are not associated with celiac disease. Our data suggest that the IL-10 alteration seen in patients may be more consequence than cause of the disease. [Abstract/Link to Full Text]

Abu-Amero KK, Al-Boudari OM, Mohamed GH, Dzimiri N
The Glu27 genotypes of the beta2-adrenergic receptor are predictors for severe coronary artery disease.
BMC Med Genet. 2006;731.
BACKGROUND: The role of the Beta2-adrenoceptor (beta2-AR) Gln27Glu polymorphism in the manifestation of cardiovascular diseases is still unclear. METHODS: In the present study, we evaluated the potential relevance of the c.79 C>G (p.Gln27Glu) polymorphism of this receptor gene for coronary artery disease (CAD) and its associated risk factors in Saudi Arabs. Genotyping was performed by PCR using the confronting two-pair primer (PCR-CTPP) method. RESULTS: In the general population group (BD) (n = 895), 68.5% were homozygous wild-type C/C, 28.3% were heterozygous C/G and 3.2% were homozygous mutant G/G. Among the CAD patients (n = 773), 50.6% were homozygous wild-type C/C, 43.6% were heterozygous C/G and 5.8% were homozygous mutant G/G, while in the angiographed control group (CON) (n = 528), 71.8% were C/C, 24.4% C/G and 3.8% G/G genotypes. These results indicate that both the C/G (p = or < .001) and G/G (p = .005) genotypes are significantly associated with CAD, when compared to the CON group. In addition, C/G (p = or < .001) and G/G (p = or < .001) were significantly associated with CAD, when compared to the BD group. Furthermore, stepwise logistic regression showed that the genotype [C/G (p < .001) and G/G (p < .001)] increase the risk of CAD. CONCLUSION: These results shows that the Gln27Glu genotypes (homo- or heterozygous) of the beta2-AR may be independent predictors of severe CAD. [Abstract/Link to Full Text]

Recent Articles in American Journal of Human Genetics

Agrawal PB, Greenleaf RS, Tomczak KK, Lehtokari VL, Wallgren-Pettersson C, Wallefeld W, Laing NG, Darras BT, Maciver SK, Dormitzer PR, Beggs AH
Nemaline myopathy with minicores caused by mutation of the CFL2 gene encoding the skeletal muscle actin-binding protein, cofilin-2.
Am J Hum Genet. 2007 Jan;80(1):162-7.
Nemaline myopathy (NM) is a congenital myopathy characterized by muscle weakness and nemaline bodies in affected myofibers. Five NM genes, all encoding components of the sarcomeric thin filament, are known. We report identification of a sixth gene, CFL2, encoding the actin-binding protein muscle cofilin-2, which is mutated in two siblings with congenital myopathy. The proband's muscle contained characteristic nemaline bodies, as well as occasional fibers with minicores, concentric laminated bodies, and areas of F-actin accumulation. Her affected sister's muscle was reported to exhibit nonspecific myopathic changes. Cofilin-2 levels were significantly lower in the proband's muscle, and the mutant protein was less soluble when expressed in Escherichia coli, suggesting that deficiency of cofilin-2 may result in reduced depolymerization of actin filaments, causing their accumulation in nemaline bodies, minicores, and, possibly, concentric laminated bodies. [Abstract/Link to Full Text]

Valdmanis PN, Meijer IA, Reynolds A, Lei A, MacLeod P, Schlesinger D, Zatz M, Reid E, Dion PA, Drapeau P, Rouleau GA
Mutations in the KIAA0196 gene at the SPG8 locus cause hereditary spastic paraplegia.
Am J Hum Genet. 2007 Jan;80(1):152-61.
Hereditary spastic paraplegia (HSP) is a progressive upper-motor neurodegenerative disease. The eighth HSP locus, SPG8, is on chromosome 8p24.13. The three families previously linked to the SPG8 locus present with relatively severe, pure spastic paraplegia. We have identified three mutations in the KIAA0196 gene in six families that map to the SPG8 locus. One mutation, V626F, segregated in three large North American families with European ancestry and in one British family. An L619F mutation was found in a Brazilian family. The third mutation, N471D, was identified in a smaller family of European origin and lies in a spectrin domain. None of these mutations were identified in 500 control individuals. Both the L619 and V626 residues are strictly conserved across species and likely have a notable effect on the structure of the protein product strumpellin. Rescue studies with human mRNA injected in zebrafish treated with morpholino oligonucleotides to knock down the endogenous protein showed that mutations at these two residues impaired the normal function of the KIAA0196 gene. However, the function of the 1,159-aa strumpellin protein is relatively unknown. The identification and characterization of the KIAA0196 gene will enable further insight into the pathogenesis of HSP. [Abstract/Link to Full Text]

Upadhyaya M, Huson SM, Davies M, Thomas N, Chuzhanova N, Giovannini S, Evans DG, Howard E, Kerr B, Griffiths S, Consoli C, Side L, Adams D, Pierpont M, Hachen R, Barnicoat A, Li H, Wallace P, Van Biervliet JP, Stevenson D, Viskochil D, Baralle D, Haan E, Riccardi V, Turnpenny P, Lazaro C, Messiaen L
An absence of cutaneous neurofibromas associated with a 3-bp inframe deletion in exon 17 of the NF1 gene (c.2970-2972 delAAT): evidence of a clinically significant NF1 genotype-phenotype correlation.
Am J Hum Genet. 2007 Jan;80(1):140-51.
Neurofibromatosis type 1 (NF1) is characterized by cafe-au-lait spots, skinfold freckling, and cutaneous neurofibromas. No obvious relationships between small mutations (<20 bp) of the NF1 gene and a specific phenotype have previously been demonstrated, which suggests that interaction with either unlinked modifying genes and/or the normal NF1 allele may be involved in the development of the particular clinical features associated with NF1. We identified 21 unrelated probands with NF1 (14 familial and 7 sporadic cases) who were all found to have the same c.2970-2972 delAAT (p.990delM) mutation but no cutaneous neurofibromas or clinically obvious plexiform neurofibromas. Molecular analysis identified the same 3-bp inframe deletion (c.2970-2972 delAAT) in exon 17 of the NF1 gene in all affected subjects. The Delta AAT mutation is predicted to result in the loss of one of two adjacent methionines (codon 991 or 992) ( Delta Met991), in conjunction with silent ACA-->ACG change of codon 990. These two methionine residues are located in a highly conserved region of neurofibromin and are expected, therefore, to have a functional role in the protein. Our data represent results from the first study to correlate a specific small mutation of the NF1 gene to the expression of a particular clinical phenotype. The biological mechanism that relates this specific mutation to the suppression of cutaneous neurofibroma development is unknown. [Abstract/Link to Full Text]

Pearson JV, Huentelman MJ, Halperin RF, Tembe WD, Melquist S, Homer N, Brun M, Szelinger S, Coon KD, Zismann VL, Webster JA, Beach T, Sando SB, Aasly JO, Heun R, Jessen F, Kolsch H, Tsolaki M, Daniilidou M, Reiman EM, Papassotiropoulos A, Hutton ML, Stephan DA, Craig DW
Identification of the genetic basis for complex disorders by use of pooling-based genomewide single-nucleotide-polymorphism association studies.
Am J Hum Genet. 2007 Jan;80(1):126-39.
We report the development and validation of experimental methods, study designs, and analysis software for pooling-based genomewide association (GWA) studies that use high-throughput single-nucleotide-polymorphism (SNP) genotyping microarrays. We first describe a theoretical framework for establishing the effectiveness of pooling genomic DNA as a low-cost alternative to individually genotyping thousands of samples on high-density SNP microarrays. Next, we describe software called "GenePool," which directly analyzes SNP microarray probe intensity data and ranks SNPs by increased likelihood of being genetically associated with a trait or disorder. Finally, we apply these methods to experimental case-control data and demonstrate successful identification of published genetic susceptibility loci for a rare monogenic disease (sudden infant death with dysgenesis of the testes syndrome), a rare complex disease (progressive supranuclear palsy), and a common complex disease (Alzheimer disease) across multiple SNP genotyping platforms. On the basis of these theoretical calculations and their experimental validation, our results suggest that pooling-based GWA studies are a logical first step for determining whether major genetic associations exist in diseases with high heritability. [Abstract/Link to Full Text]

Zheng M, McPeek MS
Multipoint linkage-disequilibrium mapping with haplotype-block structure.
Am J Hum Genet. 2007 Jan;80(1):112-25.
The HapMap Project is providing a great deal of new information on high-resolution haplotype structure in various human populations. This information has the potential to greatly increase the power of association mapping for a fixed amount of genotyping. A number of methods have been proposed for the identification of haplotype blocks, common haplotypes, and tagging single-nucleotide polymorphisms. Here, we build on this work by developing novel methods for case-control multipoint linkage-disequilibrium (LD) mapping that gain power and speed by making explicit use of the inferred block structure. Specifically, we developed a virtual-variant approach that uses the haplotype-block information to greatly increase power for detection of untyped common variants associated with a trait. Because full multipoint LD mapping can be slow, we exploited the haplotype-block information to develop a fast single-block multipoint mapping method. Our methods are appropriate for genotype data and take into account the uncertainty in phase. We describe the methods in the context of case-parents trios, although they are also applicable to unrelated cases and controls. Our simulations indicate that the most important gains from taking into account the haplotype-block structure at the analysis stage of multipoint LD mapping come from (1) greatly increased power to detect association with untyped variants and (2) greatly improved localization of untyped variants associated with the trait. More-modest gains are obtained in improving power to detect association with a variant that is typed with a moderate amount of missing data. The methods are applied to a Crohn disease data set. [Abstract/Link to Full Text]

Naveed M, Nath SK, Gaines M, Al-Ali MT, Al-Khaja N, Hutchings D, Golla J, Deutsch S, Bottani A, Antonarakis SE, Ratnamala U, Radhakrishna U
Genomewide linkage scan for split-hand/foot malformation with long-bone deficiency in a large Arab family identifies two novel susceptibility loci on chromosomes 1q42.2-q43 and 6q14.1.
Am J Hum Genet. 2007 Jan;80(1):105-11.
Split-hand/foot malformation with long-bone deficiency (SHFLD) is a rare, severe limb deformity characterized by tibia aplasia with or without split-hand/split-foot deformity. Identification of genetic susceptibility loci for SHFLD has been unsuccessful because of its rare incidence, variable phenotypic expression and associated anomalies, and uncertain inheritance pattern. SHFLD is usually inherited as an autosomal dominant trait with reduced penetrance, although recessive inheritance has also been postulated. We conducted a genomewide linkage analysis, using a 10K SNP array in a large consanguineous family (UR078) from the United Arab Emirates (UAE) who had disease transmission consistent with an autosomal dominant inheritance pattern. The study identified two novel SHFLD susceptibility loci at 1q42.2-q43 (nonparametric linkage [NPL] 9.8, P=.000065) and 6q14.1 (NPL 7.12, P=.000897). These results were also supported by multipoint parametric linkage analysis. Maximum multipoint LOD scores of 3.20 and 3.78 were detected for genomic locations 1q42.2-43 and 6q14.1, respectively, with the use of an autosomal dominant mode of inheritance with reduced penetrance. Haplotype analysis with informative crossovers enabled mapping of the SHFLD loci to a region of approximately 18.38 cM (8.4 Mb) between single-nucleotide polymorphisms rs1124110 and rs535043 on 1q42.2-q43 and to a region of approximately 1.96 cM (4.1 Mb) between rs623155 and rs1547251 on 6q14.1. The study identified two novel loci for the SHFLD phenotype in this UAE family. [Abstract/Link to Full Text]

Wong KK, deLeeuw RJ, Dosanjh NS, Kimm LR, Cheng Z, Horsman DE, MacAulay C, Ng RT, Brown CJ, Eichler EE, Lam WL
A comprehensive analysis of common copy-number variations in the human genome.
Am J Hum Genet. 2007 Jan;80(1):91-104.
Segmental copy-number variations (CNVs) in the human genome are associated with developmental disorders and susceptibility to diseases. More importantly, CNVs may represent a major genetic component of our phenotypic diversity. In this study, using a whole-genome array comparative genomic hybridization assay, we identified 3,654 autosomal segmental CNVs, 800 of which appeared at a frequency of at least 3%. Of these frequent CNVs, 77% are novel. In the 95 individuals analyzed, the two most diverse genomes differed by at least 9 Mb in size or varied by at least 266 loci in content. Approximately 68% of the 800 polymorphic regions overlap with genes, which may reflect human diversity in senses (smell, hearing, taste, and sight), rhesus phenotype, metabolism, and disease susceptibility. Intriguingly, 14 polymorphic regions harbor 21 of the known human microRNAs, raising the possibility of the contribution of microRNAs to phenotypic diversity in humans. This in-depth survey of CNVs across the human genome provides a valuable baseline for studies involving human genetics. [Abstract/Link to Full Text]

Hegele RA
Copy-number variations and human disease.
Am J Hum Genet. 2007 Aug;81(2):414-5; author reply 415. [Abstract/Link to Full Text]

Lynch AG, Marioni JC, Tavar� S
Numbers of copy-number variations and false-negative rates will be underestimated if we do not account for the dependence between repeated experiments.
Am J Hum Genet. 2007 Aug;81(2):418-20; author reply 420-1. [Abstract/Link to Full Text]

Jakobsdottir J, Weeks DE
Estimating prevalence, false-positive rate, and false-negative rate with use of repeated testing when true responses are unknown.
Am J Hum Genet. 2007 Nov;81(5):1111-3. [Abstract/Link to Full Text]

Shi M, Christensen K, Weinberg CR, Romitti P, Bathum L, Lozada A, Morris RW, Lovett M, Murray JC
Orofacial cleft risk is increased with maternal smoking and specific detoxification-gene variants.
Am J Hum Genet. 2007 Jan;80(1):76-90.
Maternal smoking is a recognized risk factor for orofacial clefts. Maternal or fetal pharmacogenetic variants are plausible modulators of this risk. In this work, we studied 5,427 DNA samples, including 1,244 from subjects in Denmark and Iowa with facial clefting and 4,183 from parents, siblings, or unrelated population controls. We examined 25 single-nucleotide polymorphisms in 16 genes in pathways for detoxification of components of cigarette smoke, to look for evidence of gene-environment interactions. For genes identified as related to oral clefting, we studied gene-expression profiles in fetal development in the relevant tissues and time intervals. Maternal smoking was a significant risk factor for clefting and showed dosage effects, in both the Danish and Iowan data. Suggestive effects of variants in the fetal NAT2 and CYP1A1 genes were observed in both the Iowan and the Danish participants. In an expanded case set, NAT2 continued to show significant overtransmission of an allele to the fetus, with a final P value of .00003. There was an interaction between maternal smoking and fetal inheritance of a GSTT1-null deletion, seen in both the Danish (P=.03) and Iowan (P=.002) studies, with a Fisher's combined P value of <.001, which remained significant after correction for multiple comparisons. Gene-expression analysis demonstrated expression of GSTT1 in human embryonic craniofacial tissues during the relevant developmental interval. This study benefited from two large samples, involving independent populations, that provided substantial power and a framework for future studies that could identify a susceptible population for preventive health care. [Abstract/Link to Full Text]

Liquori CL, Berg MJ, Squitieri F, Leedom TP, Ptacek L, Johnson EW, Marchuk DA
Deletions in CCM2 are a common cause of cerebral cavernous malformations.
Am J Hum Genet. 2007 Jan;80(1):69-75.
Cerebral cavernous malformations (CCMs) are vascular abnormalities of the brain that can result in a variety of neurological disabilities, including hemorrhagic stroke and seizures. Mutations in the gene KRIT1 are responsible for CCM1, mutations in the gene MGC4607 are responsible for CCM2, and mutations in the gene PDCD10 are responsible for CCM3. DNA sequence analysis of the known CCM genes in a cohort of 63 CCM-affected families showed that a high proportion (40%) of these lacked any identifiable mutation. We used multiplex ligation-dependent probe analysis to screen 25 CCM1, -2, and -3 mutation-negative probands for potential deletions or duplications within all three CCM genes. We identified a total of 15 deletions: 1 in the CCM1 gene, 0 in the CCM3 gene, and 14 in the CCM2 gene. In our cohort, mutation screening that included sequence and deletion analyses gave disease-gene frequencies of 40% for CCM1, 38% for CCM2, 6% for CCM3, and 16% with no mutation detected. These data indicate that the prevalence of CCM2 is much higher than previously predicted, nearly equal to CCM1, and that large genomic deletions in the CCM2 gene represent a major component of this disease. A common 77.6-kb deletion spanning CCM2 exons 2-10 was identified, which is present in 13% of our entire CCM cohort. Eight probands exhibit an apparently identical recombination event in the CCM2 gene, involving an AluSx in intron 1 and an AluSg distal to exon 10. Haplotype analysis revealed that this CCM2 deletion occurred independently at least twice in our families. We hypothesize that these deletions occur in a hypermutable region because of surrounding repetitive sequence elements that may catalyze the formation of intragenic deletions. [Abstract/Link to Full Text]

Chung RH, Morris RW, Zhang L, Li YJ, Martin ER
X-APL: an improved family-based test of association in the presence of linkage for the X chromosome.
Am J Hum Genet. 2007 Jan;80(1):59-68.
Family-based association methods have been developed primarily for autosomal markers. The X-linked sibling transmission/disequilibrium test (XS-TDT) and the reconstruction-combined TDT for X-chromosome markers (XRC-TDT) are the first association-based methods for testing markers on the X chromosome in family data sets. These are valid tests of association in family triads or discordant sib pairs but are not theoretically valid in multiplex families when linkage is present. Recently, XPDT and XMCPDT, modified versions of the pedigree disequilibrium test (PDT), were proposed. Like the PDT, XPDT compares genotype transmissions from parents to affected offspring or genotypes of discordant siblings; however, the XPDT can have low power if there are many missing parental genotypes. XMCPDT uses a Monte Carlo sampling approach to infer missing parental genotypes on the basis of true or estimated population allele frequencies. Although the XMCPDT was shown to be more powerful than the XPDT, variability in the statistic due to the use of an estimate of allele frequency is not properly accounted for. Here, we present a novel family-based test of association, X-APL, a modification of the test for association in the presence of linkage (APL) test. Like the APL, X-APL can use singleton or multiplex families and properly infers missing parental genotypes in linkage regions by considering identity-by-descent parameters for affected siblings. Sampling variability of parameter estimates is accounted for through a bootstrap procedure. X-APL can test individual marker loci or X-chromosome haplotypes. To allow for different penetrances in males and females, separate sex-specific tests are provided. Using simulated data, we demonstrated validity and showed that the X-APL is more powerful than alternative tests. To show its utility and to discuss interpretation in real-data analysis, we also applied the X-APL to candidate-gene data in a sample of families with Parkinson disease. [Abstract/Link to Full Text]

Valente L, Tiranti V, Marsano RM, Malfatti E, Fernandez-Vizarra E, Donnini C, Mereghetti P, De Gioia L, Burlina A, Castellan C, Comi GP, Savasta S, Ferrero I, Zeviani M
Infantile encephalopathy and defective mitochondrial DNA translation in patients with mutations of mitochondrial elongation factors EFG1 and EFTu.
Am J Hum Genet. 2007 Jan;80(1):44-58.
Mitochondrial protein translation is a complex process performed within mitochondria by an apparatus composed of mitochondrial DNA (mtDNA)-encoded RNAs and nuclear DNA-encoded proteins. Although the latter by far outnumber the former, the vast majority of mitochondrial translation defects in humans have been associated with mutations in RNA-encoding mtDNA genes, whereas mutations in protein-encoding nuclear genes have been identified in a handful of cases. Genetic investigation involving patients with defective mitochondrial translation led us to the discovery of novel mutations in the mitochondrial elongation factor G1 (EFG1) in one affected baby and, for the first time, in the mitochondrial elongation factor Tu (EFTu) in another one. Both patients were affected by severe lactic acidosis and rapidly progressive, fatal encephalopathy. The EFG1-mutant patient had early-onset Leigh syndrome, whereas the EFTu-mutant patient had severe infantile macrocystic leukodystrophy with micropolygyria. Structural modeling enabled us to make predictions about the effects of the mutations at the molecular level. Yeast and mammalian cell systems proved the pathogenic role of the mutant alleles by functional complementation in vivo. Nuclear-gene abnormalities causing mitochondrial translation defects represent a new, potentially broad field of mitochondrial medicine. Investigation of these defects is important to expand the molecular characterization of mitochondrial disorders and also may contribute to the elucidation of the complex control mechanisms, which regulate this fundamental pathway of mtDNA homeostasis. [Abstract/Link to Full Text]

Hill C, Soares P, Mormina M, Macaulay V, Clarke D, Blumbach PB, Vizuete-Forster M, Forster P, Bulbeck D, Oppenheimer S, Richards M
A mitochondrial stratigraphy for island southeast Asia.
Am J Hum Genet. 2007 Jan;80(1):29-43.
Island Southeast Asia (ISEA) was first colonized by modern humans at least 45,000 years ago, but the extent to which the modern inhabitants trace their ancestry to the first settlers is a matter of debate. It is widely held, in both archaeology and linguistics, that they are largely descended from a second wave of dispersal, proto-Austronesian-speaking agriculturalists who originated in China and spread to Taiwan approximately 5,500 years ago. From there, they are thought to have dispersed into ISEA approximately 4,000 years ago, assimilating the indigenous populations. Here, we demonstrate that mitochondrial DNA diversity in the region is extremely high and includes a large number of indigenous clades. Only a fraction of these date back to the time of first settlement, and the majority appear to mark dispersals in the late-Pleistocene or early-Holocene epoch most likely triggered by postglacial flooding. There are much closer genetic links to Taiwan than to the mainland, but most of these probably predated the mid-Holocene "Out of Taiwan" event as traditionally envisioned. Only approximately 20% at most of modern mitochondrial DNAs in ISEA could be linked to such an event, suggesting that, if an agriculturalist migration did take place, it was demographically minor, at least with regard to the involvement of women. [Abstract/Link to Full Text]

Morton N, Maniatis N, Zhang W, Ennis S, Collins A
Genome scanning by composite likelihood.
Am J Hum Genet. 2007 Jan;80(1):19-28.
Ambitious programs have recently been advocated or launched to create genomewide databases for meta-analysis of association between DNA markers and phenotypes of medical and/or social concern. A necessary but not sufficient condition for success in association mapping is that the data give accurate estimates of both genomic location and its standard error, which are provided for multifactorial phenotypes by composite likelihood. That class includes the Malecot model, which we here apply with an illustrative example. This preliminary analysis leads to five inferences: permutation of cases and controls provides a test of association free of autocorrelation; two hypotheses give similar estimates, but one is consistently more accurate; estimation of the false-discovery rate is extended to causal genes in a small proportion of regions; the minimal data for successful meta-analysis are inferred; and power is robust for all genomic factors except minor-allele frequency. An extension to meta-analysis is proposed. Other approaches to genome scanning and meta-analysis should, if possible, be similarly extended so that their operating characteristics can be compared. [Abstract/Link to Full Text]

Zhao X, Tang R, Gao B, Shi Y, Zhou J, Guo S, Zhang J, Wang Y, Tang W, Meng J, Li S, Wang H, Ma G, Lin C, Xiao Y, Feng G, Lin Z, Zhu S, Xing Y, Sang H, St Clair D, He L
Functional variants in the promoter region of Chitinase 3-like 1 (CHI3L1) and susceptibility to schizophrenia.
Am J Hum Genet. 2007 Jan;80(1):12-8.
The chitinase 3-like 1 gene (CHI3L1) is abnormally expressed in the hippocampus of subjects with schizophrenia and may be involved in the cellular response to various environmental events that are reported to increase the risk of schizophrenia. Here, we provide evidence that the functional variants at the CHI3L1 locus influence the genetic risk of schizophrenia. First, using case-control and transmission/disequilibrium-test (TDT) methodologies, we detected a significant association between schizophrenia and haplotypes within the promoter region of CHI3L1 in two independent cohorts of Chinese individuals. Second, the at-risk CCC haplotype (P=.00058 and .0018 in case-control and TDT studies, respectively) revealed lower transcriptional activity (P=2.2 x 10(-7)) and was associated with lower expression (P=3.1 x 10(-5)) compared with neutral and protective haplotypes. Third, we found that an allele of SNP4 (rs4950928), the tagging SNP of CCC, impaired the MYC/MAX-regulated transcriptional activation of CHI3L1 by altering the transcriptional-factor consensus sequences, and this may be responsible for the decreased expression of the CCC haplotype. In contrast, the protective TTG haplotype was associated with a high level of CHI3L1 expression. Our findings identify CHI3L1 as a potential schizophrenia-susceptibility gene and suggest that the genes involved in the biological response to adverse environmental conditions are likely to play roles in the predisposition to schizophrenia. [Abstract/Link to Full Text]

Stoetzel C, Muller J, Laurier V, Davis EE, Zaghloul NA, Vicaire S, Jacquelin C, Plewniak F, Leitch CC, Sarda P, Hamel C, de Ravel TJ, Lewis RA, Friederich E, Thibault C, Danse JM, Verloes A, Bonneau D, Katsanis N, Poch O, Mandel JL, Dollfus H
Identification of a novel BBS gene (BBS12) highlights the major role of a vertebrate-specific branch of chaperonin-related proteins in Bardet-Biedl syndrome.
Am J Hum Genet. 2007 Jan;80(1):1-11.
Bardet-Biedl syndrome (BBS) is primarily an autosomal recessive ciliopathy characterized by progressive retinal degeneration, obesity, cognitive impairment, polydactyly, and kidney anomalies. The disorder is genetically heterogeneous, with 11 BBS genes identified to date, which account for ~70% of affected families. We have combined single-nucleotide-polymorphism array homozygosity mapping with in silico analysis to identify a new BBS gene, BBS12. Patients from two Gypsy families were homozygous and haploidentical in a 6-Mb region of chromosome 4q27. FLJ35630 was selected as a candidate gene, because it was predicted to encode a protein with similarity to members of the type II chaperonin superfamily, which includes BBS6 and BBS10. We found pathogenic mutations in both Gypsy families, as well as in 14 other families of various ethnic backgrounds, indicating that BBS12 accounts for approximately 5% of all BBS cases. BBS12 is vertebrate specific and, together with BBS6 and BBS10, defines a novel branch of the type II chaperonin superfamily. These three genes are characterized by unusually rapid evolution and are likely to perform ciliary functions specific to vertebrates that are important in the pathophysiology of the syndrome, and together they account for about one-third of the total BBS mutational load. Consistent with this notion, suppression of each family member in zebrafish yielded gastrulation-movement defects characteristic of other BBS morphants, whereas simultaneous suppression of all three members resulted in severely affected embryos, possibly hinting at partial functional redundancy within this protein family. [Abstract/Link to Full Text]

Syrris P, Ward D, Evans A, Asimaki A, Gandjbakhch E, Sen-Chowdhry S, McKenna WJ
Arrhythmogenic right ventricular dysplasia/cardiomyopathy associated with mutations in the desmosomal gene desmocollin-2.
Am J Hum Genet. 2006 Nov;79(5):978-84.
Arrhythmogenic right ventricular dysplasia/cardiomyopathy (ARVD/C) is an inherited myocardial disorder associated with arrhythmias, heart failure, and sudden death. To date, mutations in four genes encoding major desmosomal proteins (plakoglobin, desmoplakin, plakophilin-2, and desmoglein-2) have been implicated in the pathogenesis of ARVD/C. We screened 77 probands with ARVD/C for mutations in desmocollin-2 (DSC2), a gene coding for a desmosomal cadherin. Two heterozygous mutations--a deletion and an insertion--were identified in four probands. Both mutations result in frameshifts and premature truncation of the desmocollin-2 protein. For the first time, we have identified mutations in desmocollin-2 in patients with ARVD/C, a finding that is consistent with the hypothesis that ARVD/C is a disease of the desmosome. [Abstract/Link to Full Text]

Wycisk KA, Zeitz C, Feil S, Wittmer M, Forster U, Neidhardt J, Wissinger B, Zrenner E, Wilke R, Kohl S, Berger W
Mutation in the auxiliary calcium-channel subunit CACNA2D4 causes autosomal recessive cone dystrophy.
Am J Hum Genet. 2006 Nov;79(5):973-7.
Retinal signal transmission depends on the activity of high voltage-gated l-type calcium channels in photoreceptor ribbon synapses. We recently identified a truncating frameshift mutation in the Cacna2d4 gene in a spontaneous mouse mutant with profound loss of retinal signaling and an abnormal morphology of ribbon synapses in rods and cones. The Cacna2d4 gene encodes an l-type calcium-channel auxiliary subunit of the alpha (2) delta type. Mutations in its human orthologue, CACNA2D4, were not yet known to be associated with a disease. We performed mutation analyses of 34 patients who received an initial diagnosis of night blindness, and, in two affected siblings, we detected a homozygous nucleotide substitution (c.2406C-->A) in CACNA2D4. The mutation introduces a premature stop codon that truncates one-third of the corresponding open reading frame. Both patients share symptoms of slowly progressing cone dystrophy. These findings represent the first report of a mutation in the human CACNA2D4 gene and define a novel gene defect that causes autosomal recessive cone dystrophy. [Abstract/Link to Full Text]

Feuk L, Kalervo A, Lipsanen-Nyman M, Skaug J, Nakabayashi K, Finucane B, Hartung D, Innes M, Kerem B, Nowaczyk MJ, Rivlin J, Roberts W, Senman L, Summers A, Szatmari P, Wong V, Vincent JB, Zeesman S, Osborne LR, Cardy JO, Kere J, Scherer SW, Hannula-Jouppi K
Absence of a paternally inherited FOXP2 gene in developmental verbal dyspraxia.
Am J Hum Genet. 2006 Nov;79(5):965-72.
Mutations in FOXP2 cause developmental verbal dyspraxia (DVD), but only a few cases have been described. We characterize 13 patients with DVD--5 with hemizygous paternal deletions spanning the FOXP2 gene, 1 with a translocation interrupting FOXP2, and the remaining 7 with maternal uniparental disomy of chromosome 7 (UPD7), who were also given a diagnosis of Silver-Russell Syndrome (SRS). Of these individuals with DVD, all 12 for whom parental DNA was available showed absence of a paternal copy of FOXP2. Five other individuals with deletions of paternally inherited FOXP2 but with incomplete clinical information or phenotypes too complex to properly assess are also described. Four of the patients with DVD also meet criteria for autism spectrum disorder. Individuals with paternal UPD7 or with partial maternal UPD7 or deletion starting downstream of FOXP2 do not have DVD. Using quantitative real-time polymerase chain reaction, we show the maternally inherited FOXP2 to be comparatively underexpressed. Our results indicate that absence of paternal FOXP2 is the cause of DVD in patients with SRS with maternal UPD7. The data also point to a role for differential parent-of-origin expression of FOXP2 in human speech development. [Abstract/Link to Full Text]

Spencer DH, Bubb KL, Olson MV
Detecting disease-causing mutations in the human genome by haplotype matching.
Am J Hum Genet. 2006 Nov;79(5):958-64.
Comparisons between haplotypes from affected patients and the human reference genome are frequently used to identify candidates for disease-causing mutations, even though these alignments are expected to reveal a high level of background neutral polymorphism. This limits the scope of genetic studies to relatively small genomic intervals, because current methods for distinguishing potential causal mutations from neutral variation are inefficient. Here we describe a new strategy for detecting mutations that is based on comparing affected haplotypes with closely matched control sequences from healthy individuals, rather than with the human reference genome. We use theory, simulation, and a real data set to show that this approach is expected to reduce the number of sequence variants that must be subjected to follow-up analysis by at least a factor of 20 when closely matched control sequences are selected from a reference panel with as few as 100 control genomes. We also define a reference data resource that would allow efficient application of this strategy to large critical intervals across the genome. [Abstract/Link to Full Text]

Konrad M, Schaller A, Seelow D, Pandey AV, Waldegger S, Lesslauer A, Vitzthum H, Suzuki Y, Luk JM, Becker C, Schlingmann KP, Schmid M, Rodriguez-Soriano J, Ariceta G, Cano F, Enriquez R, Juppner H, Bakkaloglu SA, Hediger MA, Gallati S, Neuhauss SC, Nurnberg P, Weber S
Mutations in the tight-junction gene claudin 19 (CLDN19) are associated with renal magnesium wasting, renal failure, and severe ocular involvement.
Am J Hum Genet. 2006 Nov;79(5):949-57.
Claudins are major components of tight junctions and contribute to the epithelial-barrier function by restricting free diffusion of solutes through the paracellular pathway. We have mapped a new locus for recessive renal magnesium loss on chromosome 1p34.2 and have identified mutations in CLDN19, a member of the claudin multigene family, in patients affected by hypomagnesemia, renal failure, and severe ocular abnormalities. CLDN19 encodes the tight-junction protein claudin-19, and we demonstrate high expression of CLDN19 in renal tubules and the retina. The identified mutations interfere severely with either cell-membrane trafficking or the assembly of the claudin-19 protein. The identification of CLDN19 mutations in patients with chronic renal failure and severe visual impairment supports the fundamental role of claudin-19 for normal renal tubular function and undisturbed organization and development of the retina. [Abstract/Link to Full Text]

Khateeb S, Flusser H, Ofir R, Shelef I, Narkis G, Vardi G, Shorer Z, Levy R, Galil A, Elbedour K, Birk OS
PLA2G6 mutation underlies infantile neuroaxonal dystrophy.
Am J Hum Genet. 2006 Nov;79(5):942-8.
Infantile neuroaxonal dystrophy (INAD) is an autosomal recessive progressive neurodegenerative disease that presents within the first 2 years of life and culminates in death by age 10 years. Affected individuals from two unrelated Bedouin Israeli kindreds were studied. Brain imaging demonstrated diffuse cerebellar atrophy and abnormal iron deposition in the medial and lateral globus pallidum. Progressive white-matter disease and reduction of the N-acetyl aspartate : chromium ratio were evident on magnetic resonance spectroscopy, suggesting loss of myelination. The clinical and radiological diagnosis of INAD was verified by sural nerve biopsy. The disease gene was mapped to a 1.17-Mb locus on chromosome 22q13.1 (LOD score 4.7 at recombination fraction 0 for SNP rs139897), and an underlying mutation common to both affected families was identified in PLA2G6, the gene encoding phospholipase A2 group VI (cytosolic, calcium-independent). These findings highlight a role of phospholipase in neurodegenerative disorders. [Abstract/Link to Full Text]

Toydemir RM, Brassington AE, Bayrak-Toydemir P, Krakowiak PA, Jorde LB, Whitby FG, Longo N, Viskochil DH, Carey JC, Bamshad MJ
A novel mutation in FGFR3 causes camptodactyly, tall stature, and hearing loss (CATSHL) syndrome.
Am J Hum Genet. 2006 Nov;79(5):935-41.
Activating mutations of FGFR3, a negative regulator of bone growth, are well known to cause a variety of short-limbed bone dysplasias and craniosynostosis syndromes. We mapped the locus causing a novel disorder characterized by camptodactyly, tall stature, scoliosis, and hearing loss (CATSHL syndrome) to chromosome 4p. Because this syndrome recapitulated the phenotype of the Fgfr3 knockout mouse, we screened FGFR3 and subsequently identified a heterozygous missense mutation that is predicted to cause a p.R621H substitution in the tyrosine kinase domain and partial loss of FGFR3 function. These findings indicate that abnormal FGFR3 signaling can cause human anomalies by promoting as well as inhibiting endochondral bone growth. [Abstract/Link to Full Text]

Pezzolesi MG, Li Y, Zhou XP, Pilarski R, Shen L, Eng C
Mutation-positive and mutation-negative patients with Cowden and Bannayan-Riley-Ruvalcaba syndromes associated with distinct 10q haplotypes.
Am J Hum Genet. 2006 Nov;79(5):923-34.
Phosphatase and tensin homolog deleted on chromosome 10 (PTEN) encodes a tumor-suppressor phosphatase frequently mutated in both sporadic and heritable forms of human cancer. Germline mutations are associated with a number of heritable cancer syndromes that are jointly referred to as the "PTEN hamartoma tumor syndrome" (PHTS) and include Cowden syndrome, Bannayan-Riley-Ruvalcaba syndrome, Proteus syndrome, and Proteus-like syndrome. Germline PTEN mutations have been identified in a significant proportion of patients with PHTS; however, there are still many individuals with classic diagnostic features for whom mutations have yet to be identified. To address this, we took a haplotype-based approach and investigated the association of specific genomic regions of the PTEN locus with PHTS. We found this locus to be characterized by three distinct haplotype blocks 33 kb, 65 kb, and 43 kb in length. Comparisons of the haplotype distributions for all three blocks differed significantly among patients with PHTS and controls (P=.0098, P<.0001, and P<.0001 for blocks 1, 2, and 3, respectively). "Rare" haplotype blocks and extended haplotypes account for two-to-threefold more PHTS chromosomes than control chromosomes. PTEN mutation-negative patients are strongly associated with a haplotype block spanning a region upstream of PTEN and the gene's first intron (P=.0027). Furthermore, allelic combinations contribute to the phenotypic complexity of this syndrome. Taken together, these data suggest that specific haplotypes and rare alleles underlie the disease etiology in these sample populations; constitute low-penetrance, modifying loci; and, specifically in the case of patients with PHTS for whom traditional mutations have yet to be identified, may harbor pathogenic variant(s) that have escaped detection by standard PTEN mutation-scanning methodologies. [Abstract/Link to Full Text]

Minichiello MJ, Durbin R
Mapping trait loci by use of inferred ancestral recombination graphs.
Am J Hum Genet. 2006 Nov;79(5):910-22.
Large-scale association studies are being undertaken with the hope of uncovering the genetic determinants of complex disease. We describe a computationally efficient method for inferring genealogies from population genotype data and show how these genealogies can be used to fine map disease loci and interpret association signals. These genealogies take the form of the ancestral recombination graph (ARG). The ARG defines a genealogical tree for each locus, and, as one moves along the chromosome, the topologies of consecutive trees shift according to the impact of historical recombination events. There are two stages to our analysis. First, we infer plausible ARGs, using a heuristic algorithm, which can handle unphased and missing data and is fast enough to be applied to large-scale studies. Second, we test the genealogical tree at each locus for a clustering of the disease cases beneath a branch, suggesting that a causative mutation occurred on that branch. Since the true ARG is unknown, we average this analysis over an ensemble of inferred ARGs. We have characterized the performance of our method across a wide range of simulated disease models. Compared with simpler tests, our method gives increased accuracy in positioning untyped causative loci and can also be used to estimate the frequencies of untyped causative alleles. We have applied our method to Ueda et al.'s association study of CTLA4 and Graves disease, showing how it can be used to dissect the association signal, giving potentially interesting results of allelic heterogeneity and interaction. Similar approaches analyzing an ensemble of ARGs inferred using our method may be applicable to many other problems of inference from population genotype data. [Abstract/Link to Full Text]

Mutsuddi M, Morris DW, Waggoner SG, Daly MJ, Scolnick EM, Sklar P
Analysis of high-resolution HapMap of DTNBP1 (Dysbindin) suggests no consistency between reported common variant associations and schizophrenia.
Am J Hum Genet. 2006 Nov;79(5):903-9.
DTNBP1 was first identified as a putative schizophrenia-susceptibility gene in Irish pedigrees, with a report of association to common genetic variation. Several replication studies have reported confirmation of an association to DTNBP1 in independent European samples; however, reported risk alleles and haplotypes appear to differ between studies, and comparison among studies has been confounded because different marker sets were employed by each group. To facilitate evaluation of existing evidence of association and further work, we supplemented the extensive genotype data, available through the International HapMap Project (HapMap), about DTNBP1 by specifically typing all associated single-nucleotide polymorphisms reported in each of the studies of the Centre d'Etude du Polymorphisme Humain (CEPH)-derived HapMap sample (CEU). Using this high-density reference map, we compared the putative disease-associated haplotype from each study and found that the association studies are inconsistent with regard to the identity of the disease-associated haplotype at DTNBP1. Specifically, all five "replication" studies define a positively associated haplotype that is different from the association originally reported. We further demonstrate that, in all six studies, the European-derived populations studied have haplotype patterns and frequencies that are consistent with HapMap CEU samples (and each other). Thus, it is unlikely that population differences are creating the inconsistency of the association studies. Evidence of association is, at present, equivocal and unsatisfactory. The new dense map of the region may be valuable in more-comprehensive follow-up studies. [Abstract/Link to Full Text]

Lindsay SJ, Khajavi M, Lupski JR, Hurles ME
A chromosomal rearrangement hotspot can be identified from population genetic variation and is coincident with a hotspot for allelic recombination.
Am J Hum Genet. 2006 Nov;79(5):890-902.
Insights into the origins of structural variation and the mutational mechanisms underlying genomic disorders would be greatly improved by a genomewide map of hotspots of nonallelic homologous recombination (NAHR). Moreover, our understanding of sequence variation within the duplicated sequences that are substrates for NAHR lags far behind that of sequence variation within the single-copy portion of the genome. Perhaps the best-characterized NAHR hotspot lies within the 24-kb-long Charcot-Marie-Tooth disease type 1A (CMT1A)-repeats (REPs) that sponsor deletions and duplications that cause peripheral neuropathies. We investigated structural and sequence diversity within the CMT1A-REPs, both within and between species. We discovered a high frequency of retroelement insertions, accelerated sequence evolution after duplication, extensive paralogous gene conversion, and a greater than twofold enrichment of SNPs in humans relative to the genome average. We identified an allelic recombination hotspot underlying the known NAHR hotspot, which suggests that the two processes are intimately related. Finally, we used our data to develop a novel method for inferring the location of an NAHR hotspot from sequence variation within segmental duplications and applied it to identify a putative NAHR hotspot within the LCR22 repeats that sponsor velocardiofacial syndrome deletions. We propose that a large-scale project to map sequence variation within segmental duplications would reveal a wealth of novel chromosomal-rearrangement hotspots. [Abstract/Link to Full Text]

Wimplinger I, Morleo M, Rosenberger G, Iaconis D, Orth U, Meinecke P, Lerer I, Ballabio A, Gal A, Franco B, Kutsche K
Mutations of the mitochondrial holocytochrome c-type synthase in X-linked dominant microphthalmia with linear skin defects syndrome.
Am J Hum Genet. 2006 Nov;79(5):878-89.
The microphthalmia with linear skin defects syndrome (MLS, or MIDAS) is an X-linked dominant male-lethal disorder almost invariably associated with segmental monosomy of the Xp22 region. In two female patients, from two families, with MLS and a normal karyotype, we identified heterozygous de novo point mutations--a missense mutation (p.R217C) and a nonsense mutation (p.R197X)--in the HCCS gene. HCCS encodes the mitochondrial holocytochrome c-type synthase that functions as heme lyase by covalently adding the prosthetic heme group to both apocytochrome c and c(1). We investigated a third family, displaying phenotypic variability, in which the mother and two of her daughters carry an 8.6-kb submicroscopic deletion encompassing part of the HCCS gene. Functional analysis demonstrates that both mutant proteins (R217C and Delta 197-268) were unable to complement a Saccharomyces cerevisiae mutant deficient for the HCCS orthologue Cyc3p, in contrast to wild-type HCCS. Moreover, ectopically expressed HCCS wild-type and the R217C mutant protein are targeted to mitochondria in CHO-K1 cells, whereas the C-terminal-truncated Delta 197-268 mutant failed to be sorted to mitochondria. Cytochrome c, the final product of holocytochrome c-type synthase activity, is implicated in both oxidative phosphorylation (OXPHOS) and apoptosis. We hypothesize that the inability of HCCS-deficient cells to undergo cytochrome c-mediated apoptosis may push cell death toward necrosis that gives rise to severe deterioration of the affected tissues. In summary, we suggest that disturbance of both OXPHOS and the balance between apoptosis and necrosis, as well as the X-inactivation pattern, may contribute to the variable phenotype observed in patients with MLS. [Abstract/Link to Full Text]

Smeitink JA, Elpeleg O, Antonicka H, Diepstra H, Saada A, Smits P, Sasarman F, Vriend G, Jacob-Hirsch J, Shaag A, Rechavi G, Welling B, Horst J, Rodenburg RJ, van den Heuvel B, Shoubridge EA
Distinct clinical phenotypes associated with a mutation in the mitochondrial translation elongation factor EFTs.
Am J Hum Genet. 2006 Nov;79(5):869-77.
The 13 polypeptides encoded in mitochondrial DNA (mtDNA) are synthesized in the mitochondrial matrix on a dedicated protein-translation apparatus that resembles that found in prokaryotes. Here, we have investigated the genetic basis for a mitochondrial protein-synthesis defect associated with a combined oxidative phosphorylation enzyme deficiency in two patients, one of whom presented with encephalomyopathy and the other with hypertrophic cardiomyopathy. Sequencing of candidate genes revealed the same homozygous mutation (C997T) in both patients in TSFM, a gene coding for the mitochondrial translation elongation factor EFTs. EFTs functions as a guanine nucleotide exchange factor for EFTu, another translation elongation factor that brings aminoacylated transfer RNAs to the ribosomal A site as a ternary complex with guanosine triphosphate. The mutation predicts an Arg333Trp substitution at an evolutionarily conserved site in a subdomain of EFTs that interacts with EFTu. Molecular modeling showed that the substitution disrupts local subdomain structure and the dimerization interface. The steady-state levels of EFTs and EFTu in patient fibroblasts were reduced by 75% and 60%, respectively, and the amounts of assembled complexes I, IV, and V were reduced by 35%-91% compared with the amounts in controls. These phenotypes and the translation defect were rescued by retroviral expression of either EFTs or EFTu. These data clearly establish mutant EFTs as the cause of disease in these patients. The fact that the same mutation is associated with distinct clinical phenotypes suggests the presence of genetic modifiers of the mitochondrial translation apparatus. [Abstract/Link to Full Text]

Zhou H, Brockington M, Jungbluth H, Monk D, Stanier P, Sewry CA, Moore GE, Muntoni F
Epigenetic allele silencing unveils recessive RYR1 mutations in core myopathies.
Am J Hum Genet. 2006 Nov;79(5):859-68.
Epigenetic regulation of gene expression is a source of genetic variation, which can mimic recessive mutations by creating transcriptional haploinsufficiency. Germline epimutations and genomic imprinting are typical examples, although their existence can be difficult to reveal. Genomic imprinting can be tissue specific, with biallelic expression in some tissues and monoallelic expression in others or with polymorphic expression in the general population. Mutations in the skeletal-muscle ryanodine-receptor gene (RYR1) are associated with malignant hyperthermia susceptibility and the congenital myopathies central core disease and multiminicore disease. RYR1 has never been thought to be affected by epigenetic regulation. However, during the RYR1-mutation analysis of a cohort of patients with recessive core myopathies, we discovered that 6 (55%) of 11 patients had monoallelic RYR1 transcription in skeletal muscle, despite being heterozygous at the genomic level. In families for which parental DNA was available, segregation studies showed that the nonexpressed allele was maternally inherited. Transcription analysis in patients' fibroblasts and lymphoblastoid cell lines indicated biallelic expression, which suggests tissue-specific silencing. Transcription analysis of normal human fetal tissues showed that RYR1 was monoallelically expressed in skeletal and smooth muscles, brain, and eye in 10% of cases. In contrast, 25 normal adult human skeletal-muscle samples displayed only biallelic expression. Finally, the administration of the DNA methyltransferase inhibitor 5-aza-deoxycytidine to cultured patient skeletal-muscle myoblasts reactivated the transcription of the silenced allele, which suggests hypermethylation as a mechanism for RYR1 silencing. Our data indicate that RYR1 undergoes polymorphic, tissue-specific, and developmentally regulated allele silencing and that this unveils recessive mutations in patients with core myopathies. Furthermore, our data suggest that imprinting is a likely mechanism for this phenomenon and that similar mechanisms could play a role in human phenotypic heterogeneity. [Abstract/Link to Full Text]

Wijsman EM, Rothstein JH, Thompson EA
Multipoint linkage analysis with many multiallelic or dense diallelic markers: Markov chain-Monte Carlo provides practical approaches for genome scans on general pedigrees.
Am J Hum Genet. 2006 Nov;79(5):846-58.
Computations for genome scans need to adapt to the increasing use of dense diallelic markers as well as of full-chromosome multipoint linkage analysis with either diallelic or multiallelic markers. Whereas suitable exact-computation tools are available for use with small pedigrees, equivalent exact computation for larger pedigrees remains infeasible. Markov chain-Monte Carlo (MCMC)-based methods currently provide the only computationally practical option. To date, no systematic comparison of the performance of MCMC-based programs is available, nor have these programs been systematically evaluated for use with dense diallelic markers. Using simulated data, we evaluate the performance of two MCMC-based linkage-analysis programs--lm_markers from the MORGAN package and SimWalk2--under a variety of analysis conditions. Pedigrees consisted of 14, 52, or 98 individuals in 3, 5, or 6 generations, respectively, with increasing amounts of missing data in larger pedigrees. One hundred replicates of markers and trait data were simulated on a 100-cM chromosome, with up to 10 multiallelic and up to 200 diallelic markers used simultaneously for computation of multipoint LOD scores. Exact computation was available for comparison in most situations, and comparison with a perfectly informative marker or interprogram comparison was available in the remaining situations. Our results confirm the accuracy of both programs in multipoint analysis with multiallelic markers on pedigrees of varied sizes and missing-data patterns, but there are some computational differences. In contrast, for large numbers of dense diallelic markers, only the lm_markers program was able to provide accurate results within a computationally practical time. Thus, programs in the MORGAN package are the first available to provide a computationally practical option for accurate linkage analyses in genome scans with both large numbers of diallelic markers and large pedigrees. [Abstract/Link to Full Text]

Zhao J, Jin L, Xiong M
Test for interaction between two unlinked loci.
Am J Hum Genet. 2006 Nov;79(5):831-45.
Despite the growing consensus on the importance of testing gene-gene interactions in genetic studies of complex diseases, the effect of gene-gene interactions has often been defined as a deviance from genetic additive effects, which is essentially treated as a residual term in genetic analysis and leads to low power in detecting the presence of interacting effects. To what extent the definition of gene-gene interaction at population level reflects the genes' biochemical or physiological interaction remains a mystery. In this article, we introduce a novel definition and a new measure of gene-gene interaction between two unlinked loci (or genes). We developed a general theory for studying linkage disequilibrium (LD) patterns in disease population under two-locus disease models. The properties of using the LD measure in a disease population as a function of the measure of gene-gene interaction between two unlinked loci were also investigated. We examined how interaction between two loci creates LD in a disease population and showed that the mathematical formulation of the new definition for gene-gene interaction between two loci was similar to that of the LD between two loci. This finding motived us to develop an LD-based statistic to detect gene-gene interaction between two unlinked loci. The null distribution and type I error rates of the LD-based statistic for testing gene-gene interaction were validated using extensive simulation studies. We found that the new test statistic was more powerful than the traditional logistic regression under three two-locus disease models and demonstrated that the power of the test statistic depends on the measure of gene-gene interaction. We also investigated the impact of using tagging SNPs for testing interaction on the power to detect interaction between two unlinked loci. Finally, to evaluate the performance of our new method, we applied the LD-based statistic to two published data sets. Our results showed that the P values of the LD-based statistic were smaller than those obtained by other approaches, including logistic regression models. [Abstract/Link to Full Text]

Gasper J, Swanson WJ
Molecular population genetics of the gene encoding the human fertilization protein zonadhesin reveals rapid adaptive evolution.
Am J Hum Genet. 2006 Nov;79(5):820-30.
A hallmark of positive selection (adaptive evolution) in protein-coding regions is a d(N)/d(S) ratio >1, where d(N) is the number of nonsynonymous substitutions/nonsynonymous sites and d(S) is the number of synonymous substitutions/synonymous sites. Zonadhesin is a male reproductive protein localized on the sperm head, comprising many domains known to be involved in cell-cell interaction or cell adhesion. Previous studies have shown that VWD domains (homologous to the D domains of the von Willebrand factor) are involved directly in binding to the female zona pellucida (ZP) in a species-specific manner. In this study, we sequenced 47 coding exons in 12 primate species and, by using maximum-likelihood methods to determine sites under positive selection, we show that VWD2, membrane/A5 antigen mu receptor, and mucin-like domains in zonadhesin are rapidly evolving and, thus, may be involved in binding to the ZP in a species-specific manner in primates. In addition, polymorphism data from 48 human individuals revealed significant polymorphism-to-divergence heterogeneity and a significant departure from equilibrium-neutral expectations in the frequency spectrum, suggesting balancing selection and positive selection occurring in zonadhesin (ZAN) within human populations. Finally, we observe adaptive evolution in haplotypes segregating for a frameshift mutation that was previously thought to indicate that ZAN was a potential pseudogene. [Abstract/Link to Full Text]

Hreb�cek M, Mr�zov� L, Seyrantepe V, Durand S, Roslin NM, Noskov� L, Hartmannov� H, Iv�nek R, C�zkova A, Poupetov� H, Sikora J, Urinovsk� J, Straneck� V, Zeman J, Lepage P, Roquis D, Verner A, Ausseil J, Beesley CE, Maire I, Poorthuis BJ, van de Kamp J, van Diggelen OP, Wevers RA, Hudson TJ, Fujiwara TM, Majewski J, Morgan K, Kmoch S, Pshezhetsky AV
Mutations in TMEM76* cause mucopolysaccharidosis IIIC (Sanfilippo C syndrome).
Am J Hum Genet. 2006 Nov;79(5):807-19.
Mucopolysaccharidosis IIIC (MPS IIIC, or Sanfilippo C syndrome) is a lysosomal storage disorder caused by the inherited deficiency of the lysosomal membrane enzyme acetyl-coenzyme A: alpha -glucosaminide N-acetyltransferase (N-acetyltransferase), which leads to impaired degradation of heparan sulfate. We report the narrowing of the candidate region to a 2.6-cM interval between D8S1051 and D8S1831 and the identification of the transmembrane protein 76 gene (TMEM76), which encodes a 73-kDa protein with predicted multiple transmembrane domains and glycosylation sites, as the gene that causes MPS IIIC when it is mutated. Four nonsense mutations, 3 frameshift mutations due to deletions or a duplication, 6 splice-site mutations, and 14 missense mutations were identified among 30 probands with MPS IIIC. Functional expression of human TMEM76 and the mouse ortholog demonstrates that it is the gene that encodes the lysosomal N-acetyltransferase and suggests that this enzyme belongs to a new structural class of proteins that transport the activated acetyl residues across the cell membrane. [Abstract/Link to Full Text]

Wessel J, Schork NJ
Generalized genomic distance-based regression methodology for multilocus association analysis.
Am J Hum Genet. 2006 Nov;79(5):792-806.
Large-scale, multilocus genetic association studies require powerful and appropriate statistical-analysis tools that are designed to relate genotype and haplotype information to phenotypes of interest. Many analysis approaches consider relating allelic, haplotypic, or genotypic information to a trait through use of extensions of traditional analysis techniques, such as contingency-table analysis, regression methods, and analysis-of-variance techniques. In this work, we consider a complementary approach that involves the characterization and measurement of the similarity and dissimilarity of the allelic composition of a set of individuals' diploid genomes at multiple loci in the regions of interest. We describe a regression method that can be used to relate variation in the measure of genomic dissimilarity (or "distance") among a set of individuals to variation in their trait values. Weighting factors associated with functional or evolutionary conservation information of the loci can be used in the assessment of similarity. The proposed method is very flexible and is easily extended to complex multilocus-analysis settings involving covariates. In addition, the proposed method actually encompasses both single-locus and haplotype-phylogeny analysis methods, which are two of the most widely used approaches in genetic association analysis. We showcase the method with data described in the literature. Ultimately, our method is appropriate for high-dimensional genomic data and anticipates an era when cost-effective exhaustive DNA sequence data can be obtained for a large number of individuals, over and above genotype information focused on a few well-chosen loci. [Abstract/Link to Full Text]

Mirault ME, Boucher P, Tremblay A
Nucleotide-resolution mapping of topoisomerase-mediated and apoptotic DNA strand scissions at or near an MLL translocation hotspot.
Am J Hum Genet. 2006 Nov;79(5):779-91.
The emergence of therapy-related acute myeloid leukemia (t-AML) has been associated with DNA topoisomerase II (TOP2)-targeted drug treatments and chromosomal translocations frequently involving the MLL, or ALL-1, gene. Two distinct mechanisms have been implicated as potential triggers of t-AML translocations: TOP2-mediated DNA cleavage and apoptotic higher-order chromatin fragmentation. Assessment of the role of TOP2 in this process has been hampered by a lack of techniques allowing in vivo mapping of TOP2-mediated DNA cleavage at nucleotide resolution in single-copy genes. A novel method, extension ligation-mediated polymerase chain reaction (ELMPCR), was used here for mapping topoisomerase-mediated DNA strand breaks and apoptotic DNA cleavage across a translocation-prone region of MLL in human cells. We report the first genomic map integrating translocation breakpoints and topoisomerase I, TOP2, and apoptotic DNA cleavage sites at nucleotide resolution across an MLL region harboring a t-AML translocation hotspot. This hotspot is flanked by a TOP2 cleavage site and is localized at one extremity of a minor apoptotic cleavage region, where multiple single- and double-strand breaks were induced by caspase-activated apoptotic nucleases. This cleavage pattern was in sharp contrast to that observed approximately 200 bp downstream in the exon 12 region, which displayed much stronger apoptotic cleavage but where no double-strand breaks were detected and no t-AML-associated breakpoints were reported. The localization and remarkable clustering of the t-AML breakpoints cannot be explained simply by the DNA cleavage patterns but might result from potential interactions between TOP2 poisoning, apoptotic DNA cleavage, and DNA repair attempts at specific sites of higher-order chromatin structure in apoptosis-evading cells. ELMPCR provides a new tool for investigating the role of DNA topoisomerases in fundamental genetic processes and translocations associated with cancer treatments involving topoisomerase-targeted drugs. [Abstract/Link to Full Text]