DEFINITION

DEFINITION - A concise description of the sequence. Mandatory keyword/one or more records.
  • DEFINITION format
  • The DEFINITION record gives a brief description of the sequence, proceeding from general to specific. It starts with the common name of the source organism, then gives the criteria by which this sequence is distinguished from the remainder of the source genome, such as the gene name and what it codes for, or the protein name and mRNA, or some description of the sequence's function (if the sequence is non-coding). If the sequence has a coding region, the description may be followed by a completeness qualifier, such as cds (complete coding sequence). There is no limit on the number of lines that may be part of the DEFINITION. The last line must end with a period.
  • DEFINITION Format for NLM Entries
  • The DEFINITION line for entries derived from journal-scanning at the NLM is an automatically generated descriptive summary that accompanies each DNA and protein sequence. It contains information derived from fields in a database that summarize the most important attributes of the sequence. The DEFINITION lines are designed to supplement the accession number and the sequence itself as a means of uniquely and completely specifying DNA and protein sequences. The following are examples of NLM DEFINITION lines: NADP-specific isocitrate dehydrogenase [swine, mRNA, 1 gene, 1585 nt] 94 kda fiber cell beaded-filament structural protein [rats, lens, mRNA Partial, 1 gene, 1873 nt] inhibin alpha {promoter and exons} [mice, Genomic, 1 gene, 1102 nt, segment 1 of 2] cefEF, cefG=acetyl coenzyme A:deacetylcephalosporin C o-acetyltransferase [Acremonium chrysogenum, Genomic, 2 genes, 2639 nt] myogenic factor 3, qmf3=helix-loop-helix protein [Japanese quails, embryo, Peptide Partial, 246 aa] The first part of the definition line contains information describing the genes and proteins represented by the molecular sequences. This can be gene locus names, protein names and descriptions that replace or augment actual names. Gene and gene product are linked by "=". Any special identifying terms are presented within brackets, such as: {promoter}, {N-terminal}, {EC 2.13.2.4}, {alternatively spliced}, or {3' region}. The second part of the definition line is delimited by square brackets, '[]', and provides details about the molecule type and length. The biological source, i.e., genus and species or common name as cited by the author. Developmental stage, tissue type and strain are included if available. The molecule types include: Genomic, mRNA, Peptide. and Other Genomic Material. Genomic molecules are assumed to be partial sequence unless "Complete" is specified, whereas mRNA and peptide molecules are assumed to be complete unless "Partial" is noted.
    -----------------------------------------------------------------------