Jump to content

User:Esotericdreams/sandbox

From Wikipedia, the free encyclopedia

C7ORF61

[edit]
SPACDR
Identifiers
AliasesSPACDR, chromosome 7 open reading frame 61, sperm acrosome developmental regulator, C7orf61
External IDsHomoloGene: 52845; GeneCards: SPACDR; OMA:SPACDR - orthologs
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001004323

n/a

RefSeq (protein)

NP_001004323

n/a

Location (UCSC)Chr 7: 100.46 – 100.46 Mbn/a
PubMed search[2]n/a
Wikidata
View/Edit Human

Uncharacterized protein chromosome 7 open reading frame 61 is an asperigine-poor protein in humans encoded by the c7orf61 gene. The protein function is relatively unknown and is highly conserved in mammals.

Gene

[edit]

Locus

[edit]

C7orf61 is located on the reverse (or negative) DNA strand and is situated in chromosome 7 (7q22.1) from base pairs 100,456,615-100,464,271 - roughly 7,6547 bp. [3] It has a total of 3 exons and lacks isoforms.

mRNA

[edit]

The mRNA is approximately 1019 bp and belongs to domain of unknown function 4703 (PFAM15775).[4]

Protein

[edit]

In humans, the protein contains a total of 206 amino acids.[3] The protein's molecular weight is 23.71 kDa and it's isoelectric point is 10.41.[5] DUF4703 is positioned 22-206aa of the protein.Cite error: A <ref> tag is missing the closing </ref> (see the help page).

Composition

[edit]

The amino acid composition of C7orf61 comprises of high frequencies in leucine, serine, and charged valine.[6] The protein has an unusual low frequency in asperigine, making it an asperiginine-deficient protein [6], and contains higher frequences of salt-bridge formations between glutamic acid, aspartic acid, lysine, and arginine.[6]

Characteristics and Structure

[edit]

The consent within secondary structure prediction tools CFSSP[7], SSPRED[8], and GOR4 [9] is that the protein's secondary structure consist mainly of α-helices (51.2%), with significant amounts of coiling (38.2%) and smaller fragments of beta strands (10.3%).


Predicted 3D structure of C7orf61 protein via PHYRE2.[10]

Post-translational modifications

[edit]

C7orf61 has several post-translation modification sites, most of which involve serine/threonine kinases - protein kinase C, Casein kinase II, DNA-dependent protein kinase, and Cyclin-dependent kinase 1. It is predicted to contain a Biparte nuclear localization signal (NLS_BP), a leucine-rich variant domain (LRV), and bacterial Ig-like domain (BIG-1).[11]

Subcellular Localization

[edit]

C7orf61 does not contain any trans-membrane domains or signal peptides.[12][13] The protein is predicted to be localized in the Mitochondia, with little indication of extracellular activity. [14][15] Expanded analysis of amino acid sequence KFFRWVRRAWQRIISWVF near the N-terminal suggests the presence of a mitochondrial targeting signal. [16]

Gene Regulation

[edit]

C7orf61 has high levels of expression in the testis and lower levels in the brain and connective tissues.[17] Through the assessment of microarray experiments available on NCBI Geo, it's inferred that c7orf61 is under negative regulation. [18]

Homology

[edit]

C7orf61 does not have any paralogs. Analysis via NCBI tool BLASTt[19] found the gene to be highly conserved in mammals and could not be traced farther back than 160 MYA. The following table contains a list of orthologs found in several mammalian sub-classes - this is not a comprehensive list for the proteins orthology.

Species Common Name Divergence (MYA) Accession number (from NCBI [20]) Sequence Length Percent Identity Percent Similarity
Homo sapiens Human 0 NP_001004323.1 206 100% 100%
Ceratotherium simum White Rhinoceros 96 XP_014652622.1 204 64% 81%
Canis lupus Wolf 96 XP_008963687.1 203 64% 78%
Bos taurus Cow 96 XP_005225278.1 204 64% 78%
Physeter catodon Sperm Whale 96 XP_007110691.1 204 63% 76%
Condylura cristata Mole 96 XP_004691685.1 204 62% 75%
Eptesicus fuscus Brown bat 96 XP_008139625.1 213 61% 75%
Elephantulus edwardii Elephant shrew 105 XP_006896643.1 147 58% 78%
Phascolarctos cinereus Koala 159 XP_020834746.1 160 41% 58%
Sarcophilus harrisii Tasmanian Devil 160 XP_012404266.1 162 37% 61%

References

[edit]
  1. ^ a b c GRCh38: Ensembl release 89: ENSG00000185955Ensembl, May 2017
  2. ^ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  3. ^ a b "NCBI h. sapiens Protein C7orf61". Retrieved 4 January 2018.
  4. ^ "NCBI h. sapiens C7orf61 mRNA". Retrieved 20 February 2018.
  5. ^ "ExPASy Compute pl/Mw tool". Retrieved 4 January 2018.
  6. ^ a b c "EMBL-EBI SAP". Retrieved 24 April 2018.
  7. ^ "CFSSP". Retrieved 23 April 2018.
  8. ^ "SoftBerry SSPRED". Retrieved 23 April 2018.
  9. ^ "GOR4". Retrieved 23 April 2018.
  10. ^ "PHYRE2". Retrieved 23 April 2018.
  11. ^ "SIB Motif Scan". Retrieved 23 April 2018.
  12. ^ "SignalP 4.1 Server prediction tool". Retrieved 20 April 2018.
  13. ^ "TMpred tool". Retrieved 21 April 2018.
  14. ^ "DeepLoc-1.0". Retrieved 24 April 2018.
  15. ^ "ProtComp 9.0". Retrieved 24 April 2018.
  16. ^ "Helical Wheel". Retrieved 25 April 2018.
  17. ^ "NCBI EST Profile - C7orf61". Retrieved 1 April 2018.
  18. ^ "NCBI Geo". Retrieved 2 April 2018.
  19. ^ "NCBI BLAST". Retrieved 4 February 2018.
  20. ^ "NCBI C7orf61". Retrieved 4 January 2018.