Hypertext Markup Language: A Representation of Textual Information and Metainformation For Retrieval and Interchange
Hypertext Markup Language: A Representation of Textual Information and Metainformation For Retrieval and Interchange
Abstract
HyperText Markup Language (HTML) can be used to represent
The World Wide Web (W3) initiative links related information throughout the globe. HTML provides one simple
format for providing linked information, and all W3 compatible programs are required to be capable of handling
HTML. W3 uses an Internet protocol (Hypertext Transfer Protocol, HTTP), which allows transfer representations
to be negotiated between client and server, the result being returned in an extended MIME message. HTML is
therefore just one, but an important one, of the representations used with W3.
HTML is proposed as a MIME content type.
HTML refers to the URL specification of RFCxxxx.
Implementations of HTML parsers and generators can be found in the various W3 servers and browsers, in the
public domain W3 code, and may also be built using various public domain SGML parsers such as [SGMLS].
HTML is an SGML document type with fairly generic semantics appropriate for representing information from
a wide range of applications. It is more generic than many specific SGML applications, but is still completely
device-independent.
1
RFC XXX Hypertext Markup language June 1993
1. In this document
This document contains the following parts:
Vocabulary used in this document, degrees of imperative.
HTML and MIME with discussion of character sets.
HTML and SGML and the relationship between them, and Structured text : an introduction for beginners to
SGML.
HTML Elements A list with description, example, and typical rendering.
HTML Entities Entities used to describe characters.
The HTML DTD The text of the SGML DTD for HTML
Link relationship values . A provisional list. Not part of the standard.
Registration Authority The authority for extending lists of valid vales.
References to related documents
Authors addresses Contact information.
table of contents
1.1 Vocabulary
This specification uses the words below with the precise meaning given.
Representation The encoding of information for interchange. For example, HTML is a representation of
hypertext.
Rendering The form of presentation to information to the human reader.
1.1.1 Imperatives
may The implementation is not obliged to follow this in any way.
must If this is not followed, the implementation does not conform to this specification.
shall as "must"
should If this is not followed, though the implementation officially conforms to the standard,
undesirable results may occur in practice.
typical Typical rendering is described for many elements. This is not a mandatory part of the
standard but is given as guidance for designers and to help explain the uses for which the
elements were intended.
1.1.2 Notes
Sections marked "Note:" are not mandatory parts of the specification but for guidance only.
3. Introduction
The HyperText Markup Language is defined in terms of the ISO Standard Generalized Markup Language []. SGML
is a system for defining structured document types and markup languages to represent instances of those document
types.
Every SGML document has three parts:
An SGML declaration, which binds SGML processing quantities and syntax token names to specific values.
For example, the SGML declaration in the HTML DTD specifies that the string that opens a tag is </ and
the maximum length of a name is 40 characters.
A prologue including one or more document type declarations, which specifiy the element types, element
relationships and attributes, and references that can be represented by markup. The HTML DTD specifies,
for example, that the HEAD element contains at most one TITLE element.
An instance, which contains the data and markup of the document.
We use the term HTML to mean both the document type and the markup language for representing instances of
that document type.
All HTML documents share the same SGML declaration an prologue. Hence implementations of the WorldWide
Web generally only transmit and store the instance part of an HTML document. To construct an SGML document
entity for processing by an SGML parser, it is necessary to prefix the text from “HTML DTD” on page 10 to the
HTML instance.
Conversely, to implement an HTML parser, one need only implement those parts of an SGML parser that are
needed to parse an instance after parsing the HTML DTD.
Some elements (e.g. P, LI) are empty. They have no content. They show up as just a start tag.
For the rest of the elements, the content is a sequence of data characters and nested elements. Note that the HTML
DTD in fact severely limits the amount of nesting which is allowed: most things cannot be nested, in fact. No
elements may be recursively nested. Anchors and character highlighting may be put inside other constructs.
3.1.1 Tags
Every element starts with a tag, and every non-empty element ends with a tag. Start tags are delimited by < and
>, and end tags are delimited by </ and >.
Names
The element name immediately follows the tag open delimiter. Names consist of a letter followed by up to 33
letters, digits, periods, or hyphens. Names are not case sensitive.
Attributes
In a start tag, whitespace and attributes are allowed between the element name and the closing delimiter. An
attribute consists of a name, an equal sign, and a value. Whitespace is allowed around the equal sign.
The value is specified in a string surrounded by single quotes or a string surrounded by double quotes. (See: other
tolerated forms @@)
The string is parsed like RCDATA (see below ) to determine the attribute value. This allows, for example, quote
characters in attribute values to be represented by character references.
The length of an attribute value (after parsing) is limited to 1024 characters.
Empty Elements
Empty elements have the keyword EMPTY in their declaration. For example:
<!ELEMENT NEXTID - O EMPTY>
<!ATTLIST NEXTID N NUMBER #REQUIRED>
Character Data
The keyword CDATA indicates that the content of an element is character data. Character data is all the text up to
the next end tag open delimiter-in-context. For example:
The string </ is only recognized as the opening delimiter of an end tag when it is “in context,” that is, when it
is followed by a letter. However, as soon as the end tag open delimiter is recognized, it terminates the CDATA
content. The following is an error:
Elements with RCDATA content behave much like those with CDATA, except for character references and entity
references. Elements declared like:
Character References To represent a character that would otherwise be recognized as markup, use a character
reference. The string &# signals a character reference when it is followed by a letter or a digit. The delimiter is
followed by the decimal character number and a semicolon. For example:
Entity References The HTML DTD declares entities for the less than, greater than, and ampersand characters
and each of the ISO Latin 1 characters so that you can reference them by name rather than by number.
The string & signals an entity reference when it is followed by a letter or a digit. The delimiter is followed by the
entity name and a semicolon. For example:
Note: To be sure that a string of characters has no markup, HTML writers should represent all
occurrences of <, >, and & by character or entity references.
Element Content
Some elements have, in stead of a keyword that states the type of content, a content model, which tells what patterns
of data and nested elements are allowed. If the content model of an element does not include the symbol #PCDATA
, the content is element content.
Whitespace in element content is considered markup and ignored. Any characters that are not markup, that is, data
characters, are illegal.
For example:
<!ELEMENT HEAD - - (TITLE? & ISINDEX? & NEXTID? & LINK*)>
Mixed Content
If the content model includes the symbol #PCDATA, the content of the element is parsed as mixed content. For
example:
<!ELEMENT PRE - - (#PCDATA | A | B | I | U | P)+>
<!ATTLIST PRE
WIDTH NUMBER #implied
>
This says that the PRE element contains one or more A, B, I, U, or P elements or data characters. Here’s an example
of a PRE element:
<pre>
<b>NAME</b>
cat -- concatenate<a href=’’terms.html#file’’>files</a>
<b>EXAMPLE</b>
cat <xyz
</pre>
A B element
The string “ cat concatenate”
An A element
The string “\n”
Another B element
The string “\n cat <xyz”
<HEAD>
<TITLE>HTML Guide: Recommended Usage</TITLE>
<!-- $Id: recommended.html,v 1.3 93/01/06 18:38:11 connolly Exp $ -->
</HEAD>
There are a few other SGML markup constructs that are deprecated or illegal.
Delimiter Signals...
<? Processing instruction. Terminated by >.
<![ Marked section. Marked sections are deprecated. See the SGML standard for complete
information.
<! Markup declaration. HTML defines no short reference maps, so these are errors. Termi-
nated by >.
or
<pre>
some example text
</pre>
<pre>
<!-- this line is ignored, including the linebreak character -->
first line
first line
third line
fourth line.
4. HTML Elements
This is a list of elements used in the HTML language. Documents should (but need not absolutely) contain an
initial HEAD element followed by a BODY element.
Old style documents may contain a just the contents of the normal HEAD and BODY elements, in any order. This
is deprecated but must be supported by parsers.
See also: Status of elements
4.3 Graphics
IMG The IMG tag allows inline graphics.
4.5 HEAD
The HEAD element contains all information about the document in general. It does not contain any text which is
part of the document: this is in the BODY. Within the head element, only certain elements are allowed.
4.6 BODY
The BODY element contains all the information which is part of the document, as opposed information about the
document which is in the HEAD .
The elements within the BODY element are in the order in which they should be presented to the reader.
See the list of things which are allowed within a BODY element .
4.7 Anchors
An anchor is a piece of text which marks the beginning and/or the end of a hypertext link.
The text between the opening tag and the closing tag is either the start or destination (or both) of a link. Attributes
of the anchor tag are as follows.
HREF OPTIONAL. If the HREF attribute is present, the anchor is sensitive text: the start of
a link. If the reader selects this text, (s)he should be presented with another document
whose network address is defined by the value of the HREF attribute . The format of the
network address is specified elsewhere . This allows for the form HREF="#identifier" to
refer to another anchor in the same document. If the anchor is in another document, the
attribute is a relative name , relative to the documents address (or specified base address
if any).
NAME OPTIONAL. If present, the attribute NAME allows the anchor to be the destination of
a link. The value of the attribute is an identifier for the anchor. Identifiers are arbitrary
strings but must be unique within the HTML document. Another document can then make
a reference explicitly to this anchor by putting the identifier after the address, separated
by a hash sign .
REL OPTIONAL. An attribute REL may give the relationship (s) described by the hypertext
link. The value is a comma-separated list of relationship values. Values and their
semantics will be registered by the HTML registration authority . The default relationship
if none other is given is void. REL should not be present unless HREF is present. See
Relationship values , REV .
REV OPTIONAL. The same as REL , but the semantics of the link type are in the reverse
direction. A link from A to B with REL="X" expresses the same relationship as a link
from B to A with REV="X". An anchor may have both REL and REV attributes.
URN OPTIONAL. If present, this specifies a uniform resource number for the document. See
note .
TITLE OPTIONAL. This is informational only. If present the value of this field should equal
the value of the TITLE of the document whose address is given by the HREF attribute.
See note .
METHODS OPTIONAL. The value of this field is a string which if present must be a comma separated
list of HTTP METHODS supported by the object for public use. See note .
All attributes are optional, although one of NAME and HREF is necessary for the anchor to be useful. See also:
LINK .
The format of URNs is under discussion (1993) by various working groups of the Internet Engineering Task Force.
4.8 Address
This element is for address information, signatures, authorship, etc, often at the top or bottom of a document.
<ADDRESS>
Newsletter editor<p>
J.R. Brown<p>
JimquickPost News, Jumquick, CT 01234<p>
Tel (123) 456 7890
</ADDRESS>
4.9 BASE
This element allows the URL of the document itself to be recorded in situations in which the document may be
read out of context. URLs within the document may be in a "partial" form relative to this base address.
Where the base address is not specified, the reader will use the URL it used to access the document to resolve any
relative URLs.
The one attribute is:
HREF the URL
4.10 BLOCKQUOTE
The BLOCKQUOTE element allows text quoted from another source to be rendered specially.
4.10.2 Example
I think it ends
<BLOCKQUOTE>Soft you now, the fair Ophelia. Nymph, in thy orisons,
be all my sins remembered.
</BLOCKQUOTE>
but I am not sure.
4.11 Headings
Six levels of heading are supported. (Note that a hypertext node within a hypertext work tends to need less levels
of heading than a work whose only structure is given by the nesting of headings.)
A heading element implies all the font changes, paragraph breaks before and after, and white space (for example)
necessary to render the heading. Further character emphasis or paragraph marks are not required in HTML.
H1 is the highest level of heading, and is recommended for the start of a hypertext node. It is suggested that the the
text of the first heading be suitable for a reader who is already browsing in related information, in contrast to the
title tag which should identify the node in a wider context.
The heading elements are
It is not normal practice to jump from one header to a header level more than one below, for example for follow
an H1 with an H3. Although this is legal, it is discouraged, as it may produce strange results for example when
generating other representations from the HTML.
4.11.1 Example:
<H1>This is a heading</H1>
Here is some text
<H2>Second level heading</H2>
Here is some more text.
4.12.1 Example
Warning: < IMG SRC ="triangle.gif" ALT="Warning:"> This must be done by a
qualified technician.
4.13 ISINDEX
This element informs the reader that the document is an index document. As well as reading it, the reader may use
a keyword search.
The node may be queried with a keyword search by suffixing the node address with a question mark, followed by
a list of keywords separated by plus signs. See the network address format .
Note that this tag is normally generated automatically by a server. If it is added by hand to an HTML document,
then the client will assume that the server can handle a search on the document. Obviously the server must have
this capability for it to work: simply adding <ISINDEX> in the document is not enough to make searches happen
if the server does not have a search engine!
Status: standard.
4.14 LINK
The LINK element occurs within the HEAD element of an HTML document. It is used to indicate a relationship
between the document and some other object. A document may have any number of LINK elements.
The LINK element is empty, but takes the same attributes as the anchor element .
Typical uses are to indicate authorship, related indexes and glossaries, older or more recent versions, etc. Links
can indicate a static tree structure in which the document was authored by pointing to a "parent" and "next" and
"previous" document, for example.
Servers may also allow links to be added by those who do not have the right to alter the body of a document.
4.15.1 Glossaries
A glossary (or definition list) is a list of paragraphs each of which has a short title alongside it. Apart from
glossaries, this element is useful for presenting a set of named elements to the reader. The elements within a
glossary follow are
DT The "term", typically placed in a wide left indent
DD The "definition", which may wrap onto many lines
These elements must appear in pairs. Single occurrences of DT without a following DD are illegal. The one
attribute which DL can take is
COMPACT suggests that a compact rendering be used, because the enclosed elements are individually
small, or the whole glossary is rather large, or both.
Typical rendering
The definition list DT, DD pairs are arranged vertically. For each pair, the DT element is on the left, in a column
of about a third of the display area, and the DD element is in the right hand two thirds of the display area. The DT
term is normally small enough to fit on one line within the left-hand column. If it is longer, it will either extend
across the page, in which case the DD section is moved down to separate them, or it is wrapped onto successive
lines of the left hand column.
White space is typically left between successive DT,DD pairs unless the COMPACT attribute is given. The
COMPACT attribute is appropriate for lists which are long and/or have DT,DD pairs which each take only a line
or two. It is of course possible for the rendering software to discover these cases itself and make its own decisions,
and this is to be encouraged.
The COMPACT attribute may also reduce the width of the left-hand (DT) column.
Examples of use
<DL>
<DT>Term the first<DD>definition paragraph is reasonably
long but is still displayed clearly
<DT>Term2 follows<DD>Definition of term2
</DL>
<DL COMPACT>
<DT>Term<DD>definition paragraph
<DT>Term2<DD>Definition of term2
</DL>
4.15.2 Lists
A list is a sequence of paragraphs, each of which may be preceded by a special mark or sequence number. The
syntax is:
<UL>
<LI> list element
<LI> another list element ...
</UL>
The opening list tag may be any of UL, OL, MENU or DIR. It must be immediately followed by the first list
element.
Typical rendering
The representation of the list is not defined here, but a bulleted list for unordered lists, and a sequence of numbered
paragraphs for an ordered list would be quite appropriate. Other possibilities for interactive display include
embedded scrollable browse panels.
List elements with typical rendering are:
UL A list of multi-line paragraphs, typically separated by some white space and/or marked
by bullets, etc.
OL As UL, but the paragraphs are typically numbered in some way to indicate the order as
significant.
MENU A list of smaller paragraphs. Typically one line per item, with a style more compact than
UL.
DIR A list of short elements, typically less than 20 characters. These may be arranged in
columns across the page, typically 24 character in width. If the rendering software is able
to optimize the column width as function of the widths of individual elements, so much
the better.
Example of use
<OL>
<LI> When you get to the station, leave
by the southern exit, on platform one.
4.16 Next ID
This tag takes a single attribute which is the number of the next document-wide numeric identifier to be allocated
of the form z123.
When modifying a document, old anchor ids should not be reused, as there may be references stored elsewhere
which point to them. This is read and generated by hypertext editors. Human writers of HTML usually use
mnemonic alphabetical identifiers. Browser software may ignore this tag.
The empty P element indicates a paragraph break. The exact rendering of this (indentation, leading, etc) is not
defined here, and may be a function of other tags, style sheets etc.
<P> is used between two pieces of text which otherwise would be flowed together.
You do NOT need to use <P> to put white space around heading, list, address or blockquote elements which imply
a paragraph break. It is the responsibility of the rendering software to generate that white space. A paragraph mark
which is preceded or followed by such elements which imply a paragraph break is has undefined effect and should
be avoided.
Preformatted elements in HTML are displayed with text in a fixed width font, and so are suitable for text which
has been formatted for a teletype by some existing formatting system.
Line boundaries within the text are rendered as a move to the beginning of the next line, except for one
immediately following or immediately preceding a tag.
The <p> tag should not be used. If found, it should be rendered as a move to the beginning of the next line.
Elements which define paragraph formatting (Headings, Address, etc) must not be used.
The ASCII Horizontal Tab (HT) character must be interpreted as the smallest positive nonzero number
of spaces which will leave the number of characters so far on the line as a multiple of 8. Its use is not
recommended however.
Example of use
<PRE WIDTH="80">
This is an example line
</PRE>
Note: Highlighting
Within a preformatted element, the constraint that the rendering must be on a fixed horizontal character pitch may
limit or prevent the ability of the renderer to render highlighting elements specially.
Note: Margins
The above references to the "beginning of a new line" must not be taken as implying that the renderer is forbidden
from using a (constant) left indent for rendering preformatted text. The left indent may of course be constrained
by the width required.
4.19 TITLE
The title of a document is specified by the TITLE element. The TITLE element should occur in the HEAD of the
document.
There may only be one title in any document. It should identify the content of the document in a fairly wide context.
The title is not part of the text of the document, but is a property of the whole document. It may not contain anchors,
paragraph marks, or highlighting. The title may be used to identify the node in a history list, to label the window
displaying the node, etc. It is not normally displayed in the text of a document itself. Contrast titles with headings
. The title should ideally be less than 64 characters in length. That is, many applications will display document
titles in window titles, menus, etc where there is only limited room. Whilst there is no limit on the length of a title
(as it may be automatically generated from other data), information providers are warned that it may be truncated
if long.
Examples of use
or
or
Examples of inappropriate titles are those which are only meaningful within context,
<TITLE>Introduction</TITLE>
or too long,
Status: Extra
These elements allow sections of text to be formatted in a particular way, to provide emphasis, etc. The tags do
NOT cause a paragraph break, and may be used on sections of text within paragraphs.
Where not supported by implementations, like all tags, these tags should be ignored but the content rendered.
All these tags have related closing tags, as in
Some of these styles are more explicit than others about how they should be physically represented. The logical
styles should be used wherever possible, unless for example it is necessary to refer to the formatting in the text.
(Eg, "The italic parts are mandatory".)
Note:
Browsers unable to display a specified style may render it in some alternative, or the default, style, with some loss
of quality for the reader. Some implementations may ignore these tags altogether, so information providers should
attempt not to rely on them as essential to the information content.
These element names are derived from TeXInfo macro names.
The following elements of HTML are obsolete. It is recommended that client implementors implement the obsolete
forms for compatibility with old servers.
Plaintext
Status: Obsolete .
The empty PLAINTEXT tag terminates the HTML entity. What follows is not SGML. In stead, there’s an old
HTTP convention that what follows is an ASCII (MIME "text/plain") body.
An example if its use is:
<PLAINTEXT>
0001 This is line one of a ling listing
0002 file from <any@host.inc.com> which is sent
This tag allows the rest of a file to be read efficiently without parsing. Its presence is an optimization. There is no
closing tag. The rest of the data is not in SGML.
Status: Obsolete . This are in use and should be recognized by browsers. New servers should use <PRE> instead.
These styles allow text of fixed-width characters to be embedded absolutely as is into the document. The syntax is:
<LISTING>
...
</LISTING>
or
<XMP>
...
</XMP>
The text between these tags is to be portrayed in a fixed width font, so that any formatting done by character spacing
on successive lines will be maintained. Between the opening and closing tags:
The text may contain any ISO Latin printable characters, but not the end tag opener. (See Historical note )
Line boundaries are significant, except any occurring immediately after the opening tag or before the closing
tag. and are to be rendered as a move to the start of a new line.
The ASCII Horizontal Tab (HT) character must be interpreted as the smallest positive nonzero number
of spaces which will leave the number of characters so far on the line as a multiple of 8. Its use is not
recommended however.
The LISTING element is portrayed so that at least 132 characters will fit on a line. The XMP elementis portrayed
in a font so that at least 80 characters will fit on a line but is otherwise identical to LISTING.
Status: Obsolete . These tags like all others should be ignored if not implemented. Replaced will more meaningful
elements see character highlighting .
Examples of use:
<HP1>...</HP1> <HP2>... </HP2> etc.
Comment element
Status: Obsolete
A comment element used for bracketing off unneed text and comment has been introduced in some browsers but
will be replaced by the SGML command feature in new implementations.
5. Entities
The following entity names are used in HTML , always prefixed by ampersand (&) and followed by a semicolon
as shown. They represent particular graphic characters which have special meanings in places in the markup, or
may not be part of the character set available to the writer.
< The less than sign <
> The "greater than" sign >
& The ampersand sign & itself.
" The double quote sign "
Also allowed are references to any of the ISO Latin-1 alphabet, using the entity names in the following table.
CHARSET
BASESET "ISO 646:1983//CHARSET
International Reference Version (IRV)//ESC 2/5 4/0"
DESCSET 0 9 UNUSED
9 2 9
11 2 UNUSED
13 1 13
14 18 UNUSED
32 95 32
127 1 UNUSED
BASESET "ISO Registration Number 100//CHARSET
ECMA-94 Right Part of Latin Alphabet Nr. 1//ESC 2/13 4/1"
DESCSET 128 32 UNUSED
160 95 32
255 1 UNUSED
CAPACITY SGMLREF
TOTALCAP 150000
GRPCAP 150000
SCOPE DOCUMENT
SYNTAX
SHUNCHAR CONTROLS 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18
19 20 21 22 23 24 25 26 27 28 29 30 31 127 255
BASESET "ISO 646:1983//CHARSET
International Reference Version (IRV)//ESC 2/5 4/0"
DESCSET 0 128 0
FUNCTION RE 13
RS 10
SPACE 32
TAB SEPCHAR 9
NAMING LCNMSTRT ""
UCNMSTRT ""
LCNMCHAR ".-"
UCNMCHAR ".-"
NAMECASE GENERAL YES
ENTITY NO
FEATURES
MINIMIZE
DATATAG NO
OMITTAG NO
RANK NO
SHORTTAG NO
LINK
SIMPLE NO
IMPLICIT NO
EXPLICIT NO
OTHER
CONCUR NO
SUBDOC NO
FORMAL YES
APPINFO NONE
>
<!DOCTYPE HTML [
<!-- Jul 1 93 -->
<!-- Regarding clause 6.1, SGML Document:
<!ENTITY % headelement
"TITLE|NEXTID|ISINDEX" >
<!ENTITY % bodyelement
"P | %heading |
%list | DL | HEADERS | ADDRESS | PRE | BLOCKQUOTE
| %literal">
<!ENTITY % linkattributes
"NAME NMTOKEN #IMPLIED
HREF %URL; #IMPLIED
REL CDATA #IMPLIED -- forward relationship type --
REV CDATA #IMPLIED -- reversed relationship type
to referent data:
">
>
<!ENTITY % inline "EM | TT | STRONG | B | I | U |
CODE | SAMP | KBD | KEY | VAR | DFN | CITE "
>
<!ELEMENT A - - (%text)>
<!ATTLIST A
%linkattributes;
>
<!ELEMENT DT - O EMPTY>
<!ELEMENT DD - O EMPTY>
-->
<!ATTLIST (%list)
COMPACT NAME #IMPLIED -- COMPACT, etc.--
>
<!ELEMENT LI - O EMPTY>
7.1.1 UseIndex
B is a related index for a search by a user reading this document who asks for an index search function.
A document may have any number of index links, causing several indexes top be searched in a client-defined
manner.
B must support SEARCH operations under its access protocol.
7.1.2 UseGlossary
B is an index which should be used to resolve glossary queries in the document. (Typically, a double-click on a
word which is not within an anchor).
A document may have any number of glossary links.
7.1.3 Annotation
The information in B is additional to and subsidiary to that in A.
Annotation is used by one person to write the equivalent of "margin notes" or other criticism on another’s document,
for example.
Example: The relationship between a newsgroup and its articles.
Acyclic.
7.1.4 Reply
Similar to Annotation, but there is no suggestion that B is subsidiary to A: A and B are on equal footings.
Example: The relationship between a mail message and its reply, a news article and its reply.
Acyclic.
7.1.5 Embed
If this link is followed, the node at the end of it is embedded into the display of the source document.
Acyclic.
7.1.6 Precedes
In an ordered structure defined by the author, A precedes B, B is followed by A.
Acyclic.
Any document may only have one link of this relationship, and/or one link of the reverse relationship.
Note: May be used to control navigational aids, generate printed material, etc. In conjunction with " subdocument
", may be used to define a tree such as a printed book made of hypertext document. The document can only have
one such tree.
7.1.7 Subdocument
B is a lower part in the author’s hierarchy to A. Acyclic. See also Precedes .
7.1.8 Present
Whenever A is presented, B must also be presented. This implies that whenever A is retrieved, B must also be
retrieved.
7.1.9 Search
When the link is followed, the node B should be searched rather than presented. That is, where the client software
allows it, the user should immediately be presented with a search panel and prompted for text. The search is then
performed without an intermediate retrieval or presentation of the node B
7.1.10 Supersedes
B is a previous version of A.
Acyclic.
7.1.11 History
B is a list of versions of A
A link reverse link must exist from B to A and to all other known versions of A.
These relationships convey semantics about objects described by documents, rather than the documents themselves.
7.2.1 Includes
A includes B, B is part of A. For example, a person described by document A is a part of the group described by
document B.
Acyclic.
7.2.2 Made
Person (etc) described by node A is author of, or is responsible for B
This information can be used for protection, and informing authors of interest, for sending mail to authors, etc.
7.2.3 Interested
Person (etc) described by A is interested in node B.
This information can be used for notification of changes.
Typically, this is a request that, when object B changes in some way, a new link is made to object A.
The phrase "object B changes" may be interpreted narrowly (as "B itself changes") or widely (as "B or anythink
linked to it or related to it closely changes"). The amount of change considered worth notifying people about is
also subject to interpretation, varying from bit changes in the source to a "new edition" statement by the publisher.
8. Registration Authority
The HTTP Registration Authority is responsible for maintaining lists of:
It is proposed that the Internet Assigned Numbers Authority or their successors take this role.
Unregistered values may be used for experimental purposes if they are start with "X-".
9. References
SGML ISO 8879:1986, Information Processing Text and Office Systems Standard Generalized
Markup Language (SGML).
sgmls an SGML parser by James Clark <jjc@jclark.com> derived from the ARCSGML parser
materials which were written by Charles F. Goldfarb. The source is available on the
ifi.uio.no FTP server in the directory /pub/SGML/SGMLS .
WWW The World-Wide Web , a global information initiative. For bootstrap information, telnet
info.cern.ch or find documents by ftp://info.cern.ch/pub/www/doc
URL Universal Resource Locators. RFCxxx. Currently available by anonymous FTP from
info.cern.ch in /pub/ietf.
Daniel Connolly
Address: Atrium Technologies, Inc.
5000 Plaza on the Lake, Suite 275
Austin, TX 78746
USA
email: connolly@atrium.com