CCDC94

From Wikipedia the free encyclopedia

YJU2
Identifiers
AliasesYJU2, coiled-coil domain containing 94, CCDC94, YJU2 splicing factor homolog
External IDsMGI: 1920136 HomoloGene: 6350 GeneCards: YJU2
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_018074

NM_028381

RefSeq (protein)

NP_060544

NP_082657

Location (UCSC)Chr 19: 4.25 – 4.27 MbChr 17: 56.27 – 56.28 Mb
PubMed search[3][4]
Wikidata
View/Edit HumanView/Edit Mouse

Coiled-coil domain containing 94 (CCDC94) is a protein that in humans is encoded by the CCDC94 gene.[5] The CCDC94 protein contains a coiled-coil domain, a domain of unknown function (DUF572), an uncharacterized conserved protein (COG5134), and lacks a transmembrane domain.

Gene[edit]

Overview[edit]

Genomic location of CCDC94 at 19p13.3[6]

CCDC94 is a 21,975 basepair gene orientated on the plus strand (see Sense) of chromosome 19 from 4,247,111-4,269,085.[5] The gene product is a 1,441 base pair mRNA with 8 predicted exons in the human gene. As predicted by Ensemble, there exists one protein-coding alternative splice form.[7] This splice form contains 5 exons, and 4 of them are coding exons. Promoter prediction and analysis was carried out using ElDorado.[8] The predicted promoter region spans 714 basepairs from 4,246,532 to 4,247,245 on the plus strand of chromosome 19.

Gene neighborhood[edit]

CCDC94 is located directly adjacent to the EBI3 gene (4,229,540-4,237,525) on the positive DNA strand. The SH2 domain gene (4,278,598-4,290,720) lies upstream from CCDC94 on the positive strand.[9]

Gene expression[edit]

CCDC94 is expressed in low to moderate levels throughout most regions of the body. However, slightly elevated levels of CCDC94 are expressed in the thyroid, lung, dendritic cells, and lymphoblasts. Expression data is available at BioGPS.[10] GEO expression data is available from NCBI.[11]

CCDC94 Geo profile expression in normal tissues.[11]

Protein[edit]

Properties and characteristics[edit]

CCDC94 belongs to the CWC16 family[12] and its function is not well understood. The human form as 323 amino acid residues, with an isoelectric point of 5.618 and a molecular mass of 37,086 daltons. There are no predicted transmembrane domains.[13] The one alternative splice form of CCDC94 encodes for a protein with 161 amino acids.[14] A DUF572 and COG5134 domains are located at residues 1-319 and 7–108, respectively.[15] The coiled-coil domain region is located at residues 105–206.[16] The intracellular localization of CCDC94 has not yet been experimentally determined, but bioinformatic analysis using PSORT highly suggests CCDC94 resides in the nucleus due to the presence of nuclear localization signals.[17]

The CCDC94 protein construct, including the COG5134, DUF572, and coiled-coil domains.

Protein interactions[edit]

Protein interaction analysis for CCDC94 has been carried out using computational tools. No interactions were identified through the MINT database.[18] CCDC94 is shown to interact with CDC5L, PLRG1, and PRPF19 with the highest score based on an anti tag coimmunoprecipitation assay.[19] 6 additional interacting proteins were found. Closer analysis shows very little potential for these interactions to be real, thus none should be considered actual protein-protein interactions. The protein interaction from the STRING analysis is shown.


Transcription factors[edit]

CCDC94 has a promoter region that contains sites for transcription factor binding. Notable transcription factors, as generated by the ElDorado program on Genomatix:[20]

  • Myeloid zinc finger protein (MZF1)
  • Forkhead box H1 (Foxh1)
  • Polyomavirus enhancer A binding protein 3 (ETV4)
  • E2F-myc activator/cell cycle regulator (E2F)
  • SPI-1 proto-oncogene; hematopoietic transcription factor (PU1)

Post-translational modifications[edit]

Bioinformatic analysis of CCDC94 using NetPhos[21] predicted 7 phosphorylation sites at serine residues, 3 at threonine residues, and 3 at tyrosine residues. Two of the threonine and all of the tyrosine phosphorylated residues are highly conserved as supported by their occurrence at the same location in several analyzed orthologs. Predicted phosphorylated tyrosines with high scores occurred on the N-terminus half of CCDC94 while serine residues are phosphorylated on the C-terminus half. Sulfinator predicted only one tyrosine sulfation site at amino acid 98.[22] Highly probably sumoylation sites at residues 90, 24, and 270 were predicted by SUMOplot.[23]

Tertiary structure[edit]

The tertiary structure of CCDC94 was shown to have several beta sheet regions and only one highly predicted alpha helix region. The PHYRE2 analysis of 65 residues of CCDC94, 20% of the entire amino acid sequence, was modeled with 87.9% confidence.[24]

CCDC94 tertiary structure as predicted by PHYRE2.[24]

Homology[edit]

Orthologs[edit]

CCDC94 is very well conserved in many species, and the entire protein is conserved throughout all of its orthologs.[25] However, conservation does not extend as far back as bacteria. A phylogenetic tree, generated from Biology WorkBench[26] shows the evolutionary relationships between Homo sapiens CCDC94 and its orthologs. The table below show CCDC94 conservation among orthologs:

Genus Species Organism Common Name Divergence from Humans (MYA) [27] NCBI Protein Accession Sequence Similarity [25] Protein Length
Pan panicous Bonobo 6.3 XP_003819321.1 99% 323
Gorilla gorilla gorilla Gorilla 8.8 XP_004059817.1 98% 286
Callithrix jacchus Common marmoset 42.6 XP_002761642.1 83% 278
Mus musculus Mouse 92.3 NP_082657.1 87% 314
Rattus norvegicus Rat 92.4 NP_001103143.1 87% 313
Cricetulus griseus Chinese hamster 92.4 XP_003501789.1 85% 321
Bos taurus Cow 94.4 NP_001069159.1 89% 320
Felis catus Cat 94.4 XP_003981794.1 73% 363
Sarcophilus harrisii Tasmanian Devil 163.9 XP_003760628.1 78% 326
Monodelphis domestica Opossum 163.9 XP_001374444.1 86% 326
Gallus gallus Red junglefoul 296.4 XP_423475.3 84% 291
Anolis carolinensis Lizard 324.5 XP_003230268.1 72% 311
Xenopou tropicalis Western clawed frog 342.7 NP_001017176.1 73% 345
Xenopus laevis African clawed frog 371.2 NP_001087648.1 83% 280
Takifugu rubripes Puffer fish 454.6 XP_003962830.1 64% 348
Acyrthosiphon pisum Pea aphid (insect) 910 NP_001155925.1 49% 278
Harpegnathos saltor Ant 910 EFN80619.1 47% 351

Paralogs[edit]

CCDC94 has only one paralog, CCDC130 or MGC10471.[28] CCDC130 is very similar to CCDC94, as it contains both the DUF572 and COG5134 domain.[29]

References[edit]

  1. ^ a b c GRCh38: Ensembl release 89: ENSG00000105248Ensembl, May 2017
  2. ^ a b c GRCm38: Ensembl release 89: ENSMUSG00000003208Ensembl, May 2017
  3. ^ "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. ^ "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. ^ a b "Coiled-coil domain-containing 94 Homo sapiens". NCBI. Retrieved May 10, 2013.
  6. ^ "Coiled-coil domain-containing 94". GeneCards. Retrieved May 12, 2013.
  7. ^ "Transcript variants". Ensemble. Retrieved May 10, 2013.
  8. ^ "ElDorado:Genomes and Annotation". Genomatix. Archived from the original on May 22, 2021. Retrieved May 11, 2013.
  9. ^ "Coiled-coil domain-containing 94 Homo sapiens". NCBI. Retrieved May 11, 2013.
  10. ^ "Tissue-specific mRNA expression". BioGPS. Retrieved May 11, 2013.
  11. ^ a b "CCDC94:Multiple Normal Tissues". NCBI. Retrieved May 12, 2013.
  12. ^ "GeneCards:CCDC94". GeneCards. Retrieved May 10, 2013.
  13. ^ "Biology WorkBench SAPS Program". Biology WorkBench. Retrieved May 11, 2013.[permanent dead link]
  14. ^ "Transcript: CCDC94". Ensemble. Retrieved May 11, 2013.
  15. ^ "Coiled-coil domain-containing 94". NCBI. Retrieved May 11, 2013.
  16. ^ "UniProt CCDC94". UniProt. Retrieved May 11, 2013.
  17. ^ "PSORT Prediction". PSORT. Retrieved May 11, 2013.
  18. ^ "MINT Protein Interactions". MINT.
  19. ^ "Relevant datasets in Homo sapiens". STRING. Retrieved May 11, 2013.
  20. ^ "ElDorado:Genome and Annotation". Geonmatix. Archived from the original on May 22, 2021. Retrieved May 11, 2013.
  21. ^ "NetPhos 2.0 server". ExPasy. Retrieved May 12, 2013.
  22. ^ "The Sulfinator". ExPasy. Retrieved May 12, 2013.
  23. ^ "SUMOplot Analysis Program". ABGENT. Retrieved May 12, 2013.
  24. ^ a b "CCDC94 Tertiary Structure Prediction". Retrieved May 11, 2013.
  25. ^ a b "BLAST". NCBI. Retrieved May 12, 2013.
  26. ^ "Protein Analysis Tools". Biology WorkBench. Retrieved May 12, 2013.[permanent dead link]
  27. ^ "Time Tree".
  28. ^ "coiled-coil domain-containing 94". GeneCards. Retrieved May 11, 2013.
  29. ^ "Coiled-coil domain-containing 130 Homo sapiens". NCBI. Retrieved May 11, 2013.

External links[edit]