← All Tools

UniProt

VERIFIED

## Connections

protein-database sequence conservation

What It Does

  • Protein sequences (reviewed SwissProt + unreviewed TrEMBL)
  • Functional annotations: domains, sites, modifications
  • Cross-references to 200+ databases (PDB, Pfam, InterPro, GO)
  • Ortholog clusters for conservation analysis
  • Variant annotations

How to Use

### Web

1. Go to https://www.uniprot.org

2. Search: Q7RTU9 (STRC human) or gene name "STRC"

3. Browse: Function, Structure, Disease, Cross-references tabs

### REST API


        # Get protein entry
        curl "https://rest.uniprot.org/uniprotkb/Q7RTU9.json"
        
        # Get FASTA sequence
        curl "https://rest.uniprot.org/uniprotkb/Q7RTU9.fasta"
        
        # Search orthologs
        curl "https://rest.uniprot.org/uniprotkb/search?query=gene:STRC+AND+reviewed:true&format=json"
        

### Python


        import requests
        r = requests.get("https://rest.uniprot.org/uniprotkb/Q7RTU9.json")
        data = r.json()
        print(f"Protein: {data['proteinDescription']['recommendedName']['fullName']['value']}")
        print(f"Length: {data['sequence']['length']} aa")
        # Stereocilin, 1809 aa
        

Verified Status

VERIFIED — STRC = Q7RTU9, Stereocilin, 1775 amino acids (mature chain 23-1775 after signal peptide cleavage). E1659 falls within the main chain with no specific domain/site annotation at that position — it's in the core scaffolding region. This means E1659 is not in a known binding site or catalytic domain per UniProt, but the high conservation (100% across mammals) and AlphaMissense score (0.9016) indicate functional importance not yet annotated. 9 ortholog sequences retrieved for conservation analysis.

STRC Research Usage

  • STRC Gene — canonical sequence reference (Q7RTU9)
  • STRC Cross-Species Conservation Analysis — ortholog sequences from UniProt
  • STRC Signal Peptide Validation — signal peptide annotation
  • STRC GPI-Anchor Analysis — GPI anchor prediction from annotations

Results (April 2026)

  • Domain architecture: mature chain residues 23-1775 (after signal peptide). No specific domain annotation at E1659 — it's in the main scaffolding body.
  • 14 N-linked glycosylation sites across the protein
  • Natural variant rs2920791 at position 1640 (near E1659) — context for the region
  • Still untapped: IntAct interactome, detailed PTM mapping around E1659

Results (April 2026)

  • Domain architecture: mature chain residues 23-1775 (after signal peptide). No specific domain annotation at E1659 — it's in the main scaffolding body.
  • 14 N-linked glycosylation sites across the protein
  • Natural variant rs2920791 at position 1640 (near E1659) — context for the region
  • Still untapped: IntAct interactome, detailed PTM mapping around E1659