Example of input of the program


A sequence (e.g. CXB1_RAT) can be submitted in SwissProt format as shown below:

ID   CXB1_RAT       STANDARD;      PRT;   283 AA.
AC   P08033;
DT   01-AUG-1988 (Rel. 08, Created)
DT   01-AUG-1988 (Rel. 08, Last sequence update)
DT   28-FEB-2003 (Rel. 41, Last annotation update)
DE   Gap junction beta-1 protein (Connexin 32) (Cx32) (GAP junction 28 kDa
DE   liver protein).
GN   GJB1 OR CXN-32.
OS   Rattus norvegicus (Rat).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Rattus.
OX   NCBI_TaxID=10116;
RN   [1]
RP   SEQUENCE FROM N.A.
RC   TISSUE=Liver;
RX   MEDLINE=86251013; PubMed=3013898;
RA   Paul D.L.;
RT   "Molecular cloning of cDNA for rat liver gap junction protein.";
RL   J. Cell Biol. 103:123-134(1986).
RN   [2]
RP   SEQUENCE OF 7-119 FROM N.A.
RC   TISSUE=Liver;
RX   MEDLINE=86301142; PubMed=3017758;
RA   Heynkes R., Kozjek G., Traub O., Willecke K.;
RT   "Identification of a rat liver cDNA and mRNA coding for the 28 kDa
RT   gap junction protein.";
RL   FEBS Lett. 205:56-60(1986).
CC   -!- FUNCTION: ONE GAP JUNCTION CONSISTS OF A CLUSTER OF CLOSELY PACKED
CC       PAIRS OF TRANSMEMBRANE CHANNELS, THE CONNEXONS, THROUGH WHICH
CC       MATERIALS OF LOW MW DIFFUSE FROM ONE CELL TO A NEIGHBORING CELL.
CC   -!- SUBUNIT: A CONNEXON IS COMPOSED OF A HEXAMER OF CONNEXINS.
CC   -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN.
CC   -!- SIMILARITY: BELONGS TO THE CONNEXIN FAMILY. BETA-TYPE (GROUP I)
CC       SUBFAMILY.
DR   EMBL; L36875; AAA75195.1; -.
DR   PIR; A26278; GJRT.
DR   InterPro; IPR000500; Connexin.
DR   Pfam; PF00029; connexin; 1.
DR   PRINTS; PR00206; CONNEXIN.
DR   SMART; SM00037; CNX; 1.
DR   PROSITE; PS00407; CONNEXINS_1; 1.
DR   PROSITE; PS00408; CONNEXINS_2; 1.
KW   Gap junction; Transmembrane.
FT   DOMAIN        1     22       CYTOPLASMIC (POTENTIAL).
FT   TRANSMEM     23     45       POTENTIAL.
FT   DOMAIN       46     75       EXTRACELLULAR (POTENTIAL).
FT   TRANSMEM     76     95       POTENTIAL.
FT   DOMAIN       96    130       CYTOPLASMIC (POTENTIAL).
FT   TRANSMEM    131    153       POTENTIAL.
FT   DOMAIN      154    191       EXTRACELLULAR (POTENTIAL).
FT   TRANSMEM    192    214       POTENTIAL.
FT   DOMAIN      215    283       CYTOPLASMIC (POTENTIAL).
SQ   SEQUENCE   283 AA;  32003 MW;  C79FC46AA13BC5D7 CRC64;
     MNWTGLYTLL SGVNRHSTAI GRVWLSVIFI FRIMVLVVAA ESVWGDEKSS FICNTLQPGC
     NSVCYDHFFP ISHVRLWSLQ LILVSTPALL VAMHVAHQQH IEKKMLRLEG HGDPLHLEEV
     KRHKVHISGT LWWTYVISVV FRLLFEAVFM YVFYLLYPGY AMVRLVKCEA FPCPNTVDCF
     VSRPTEKTVF TVFMLAASGI CIILNVAEVV YLIIRACARR AQRRSNPPSR KGSGFGHRLS
     PEYKQNEINK LLSEQDGSLK DILRRSPGTG AGLAEKSDRC SAC
//



or in FASTA format as shown below:

>CXB1_RAT
MNWTGLYTLLSGVNRHSTAIGRVWLSVIFIFRIMVLVVAAESVWGDEKSSFICNTLQPGC
NSVCYDHFFPISHVRLWSLQLILVSTPALLVAMHVAHQQHIEKKMLRLEGHGDPLHLEEV
KRHKVHISGTLWWTYVISVVFRLLFEAVFMYVFYLLYPGYAMVRLVKCEAFPCPNTVDCF
VSRPTEKTVFTVFMLAASGICIILNVAEVVYLIIRACARRAQRRSNPPSRKGSGFGHRLS
PEYKQNEINKLLSEQDGSLKDILRRSPGTGAGLAEKSDRCSAC