formation and expression ofpseudogenes
TRANSCRIPT
Introduction
The word ‘pseudogene’ was first used by Jacq et
al., 1977.
Pseudogenes are dysfunctional relatives of genes
that have lost their protein-coding ability or are
otherwise no longer expressed in the cell.
Charecteristic features,
• Highest homology to parental functional gene
• Presence of disablements to prevent its expression
4
Origin (formation) of Pseudogenes
Ondrage et al., 2011
1) Pseudogene formation after DNA- mediated
gene duplication
6
Processed ψn:
Lack the promoter so “dead on arrival”
Contains poly-A tail
Lack intron sequence
Random association with parent gene
9
Unprocessed ψn
Highest structural similarity with parent gene
Intact exons and introns
Prevalence of ORF disrupting mutations
Close association with parent gene
10
Unitary ψn
No gene duplication before pseudogenization
Rare in occurrence
Fixed disabling mutations
11
Detection of Pseudogenes
Homology-Based Approaches:
Harrison's Approach
(pseudogene annotation pipeline, 2001)
Sakai's Approach
PPFINDER
Pseudogene Finder (PSF)
PseudoPipe
13
Pseudogene databases
HoppsiGEN
PseudoGeneQuest
Pseudogene.Org
University of Iowa Pseudogene Resource
14
Identification of pseudogenes in the rice gene
complementNissen et al., 2009
1 Pseudogenes (with parent gene and at least one frameshift or premature stop codon)
2 GPFs not supported by cDNA or EST evidence
3 The UTRs of the GPFs are longer than mean + 2 standard deviations
4 The CDS of the GPFs are shorter than 50 amino acids
5 The GPFs contain a stretch of 18 adenines in a 20-base window, within -200 to 400 bases from the end of the
annotated UTR, or within 600 bases of the stop codon if no UTR is annotated
6 The GPFs have a significantly smaller number of exons
7 The GPFs contain a single exon and are within a segmentally duplicated region but have no paralog in the duplicated
region
15
Shared syntenic presentation of pseudogene
copies in four different Solanum species
Peter et al., 2011 20
Analysis of miRNA targeting the 3'-UTR of TUSC2 and pseudogene TUSC2P on chromosome Y and chromosome X.
Zina et al.,2014
24
Expression analysis of TUSC2P in cancer and non-cancer
cell lines by real-time PCR and reverse transcription–PCR
Zina et al.,2014
25
Zina et al.,2014
TUSC2P/TUSC2-UTR attracts endogenous miRNAs thus freeing TUSC2,
TIMP2 and TIMP3 mRNAs to be translated
28
Identifying small RNAs derived from gene-pseudogene pairs or
adjacent pseudogene-pseudogene pairs.
Guo et al., 2009
29
sequence comparison between rye B-located pseudogene-
like fragments (6 to 15) and their A-located parental
counterparts. (Banae et al., 2013)
34
Tissue- and Species-Specific Transcription of B-Located Pseudogene-Like
Fragments in Rye and Wheat.
(Banae et al., 2013)
Rye specific expression
B specific expression
Constitutive expression
Regulation of A
located gene
expression
35
35
Relationship between the numbers of ψn and annotated
functional genes in Pfam domain families.
Zou et al., 200939
Other utilities
Evolutionary related studies
Neutral substitution rate
Information on splice diversity of RNA
As record of fast gene expression fossils
40
40