A Uniform Resource Identifier (URI) is a compact string of characters
for identifying an abstract or physical resource. See IETF RFC2396 at
Uniform Resource Identifiers (formerly UniformResourceLocator
s, or URLs--this term is still commonly used) are the lifeblood of the WorldWideWeb
URL and URI are different concepts. A URL gives you information on how you can access a resource, whereas a URI merely identifies a resource and does not in general give you any information on how you might access it (or even imply that you can access it in any meaningful way). For example,
is the URI for the XmlNameSpace for some exchange format to do with MicrosoftExcel and this URI is not an URL.
Many have the form:
is one of http https file mailto news telnet gopher ftp plus some more obsolete protocols.
is an IP address in decimal or decimal dotted form, or the name of a host on that can be resolved to an IP address.
is the internet port to connect to. If absent, the default port for the given protocol
is used (e.g., port 80 for http). The hostname and port combo must end with a /.
is somewhat dependent on the protocol, and sometimes is not present. It's typically hierarchical. The last part of the path is often used to indicate the type of data and what the client might do to it. For example ".txt" indicates normal preformatted ASCII text and ".html" HTML.
is the non-hierarchical segment, typically used as command arguments.
See also http://usemod.com/cgi-bin/mb.pl?UniformResourceLocator
Actually, it's not that simple, and people shouldn't be writing URI parsers
if under the impression that it is. See
for details --DanBarlow
Specifically, a URL has location information contained in it, but URLs are only one kind of URI. This is an instance of OnlyKnowingBestKnownCase. -- RobertField
Expect this page to have a high lapse rate as the details get straightened out. URIs often only partially understood.
CategoryAcronym CategorySemanticWeb CategoryDiscovery