Skip to main content

Tags for the Identification of Languages
RFC 3066

Document Type RFC - Best Current Practice (January 2001) Errata
Obsoleted by RFC 4646, RFC 4647
Obsoletes RFC 1766
Was draft-alvestrand-lang-tag-v2 (individual)
Author Harald T. Alvestrand
Last updated 2020-01-21
RFC stream Legacy stream
Formats
IESG Responsible AD (None)
Send notices to (None)
RFC 3066
<new code> instead" to the
   "other relevant information" section.

   Note: The purpose of the "published description" is intended as an
   aid to people trying to verify whether a language is registered, or
   what language a particular tag refers to.  In most cases, reference
   to an authoritative grammar or dictionary of the language will be
   useful; in cases where no such work exists, other well known works
   describing that language or in that language may be appropriate.  The
   language tag reviewer decides what constitutes a "good enough"
   reference material.

4. Security Considerations

   The only security issue that has been raised with language tags since
   the publication of RFC 1766, which stated that "Security issues are
   believed to be irrelevant to this memo", is a concern with language
   ranges used in content negotiation - that they may be used to infer
   the nationality of the sender, and thus identify potential targets
   for surveillance.

   This is a special case of the general problem that anything you send
   is visible to the receiving party; it is useful to be aware that such
   concerns can exist in some cases.

   The evaluation of the exact magnitude of the threat, and any possible
   countermeasures, is left to each application protocol.

5. Character set considerations

   Language tags may always be presented using the characters A-Z, a-z,
   0-9 and HYPHEN-MINUS, which are present in most character sets, so
   presentation of language tags should not have any character set
   issues.

Alvestrand               Best Current Practice                  [Page 9]
RFC 3066          Tags for Identification of Languages      January 2001

   The issue of deciding upon the rendering of a character set based on
   the language tag is not addressed in this memo; however, it is
   thought impossible to make such a decision correctly for all cases
   unless means of switching language in the middle of a text are
   defined (for example, a rendering engine that decides font based on
   Japanese or Chinese language may produce suboptimal output when a
   mixed Japanese-Chinese text is encountered)

6. Acknowledgements

   This document has benefited from many rounds of review and comments
   in various fora of the IETF and the Internet working groups.

   Any list of contributors is bound to be incomplete; please regard the
   following as only a selection from the group of people who have
   contributed to make this document what it is today.

   In alphabetical order:

   Glenn Adams, Tim Berners-Lee, Marc Blanchet, Nathaniel Borenstein,
   Eric Brunner, Sean M. Burke, John Clews, Jim Conklin, Peter
   Constable, John Cowan, Mark Crispin, Dave Crocker, Mark Davis, Martin
   Duerst, Michael Everson, Ned Freed, Tim Goodwin, Dirk-Willem van
   Gulik, Marion Gunn, Paul Hoffman, Olle Jarnefors, Kent Karlsson, John
   Klensin, Alain LaBonte, Chris Newman, Keith Moore, Masataka Ohta,
   Keld Jorn Simonsen, Otto Stolz, Rhys Weatherley, Misha Wolf, Francois
   Yergeau and many, many others.

   Special thanks must go to Michael Everson, who has served as language
   tag reviewer for almost the complete period since the publication of
   RFC 1766, and has provided a great deal of input to this revision.

7. Author's Address

   Harald Tveit Alvestrand
   Cisco Systems
   Weidemanns vei 27
   7043 Trondheim
   NORWAY

   Phone: +47 73 50 33 52
   EMail: Harald@Alvestrand.no

Alvestrand               Best Current Practice                 [Page 10]
RFC 3066          Tags for Identification of Languages      January 2001

8. References

   [ISO 639]   ISO 639:1988 (E/F) - Code for the representation of names
               of languages - The International Organization for
               Standardization, 1st edition, 1988-04-01 Prepared by
               ISO/TC 37 - Terminology (principles and coordination).
               Note that a new version (ISO 639-1:2000) is in
               preparation at the time of this writing.

   [ISO 639-2] ISO 639-2:1998 - Codes for the representation of names of
               languages -- Part 2: Alpha-3 code  - edition 1, 1998-11-
               01, 66 pages, prepared by a Joint Working Group of ISO
               TC46/SC4 and ISO TC37/SC2.

   [ISO 3166]  ISO 3166:1988 (E/F) - Codes for the representation of
               names of countries - The International Organization for
               Standardization, 3rd edition, 1988-08-15.

   [RFC 1327]  Kille, S., "Mapping between X.400 (1988) / ISO 10021 and
               RFC 822", RFC 1327, May 1992.

   [RFC 1521]  Borenstein, N., and N. Freed, "MIME Part One: Mechanisms
               for Specifying and Describing the Format of Internet
               Message Bodies", RFC 1521, September 1993.

   [RFC 2026]  Bradner, S., "The Internet Standards Process -- Revision
               3", BCP 9, RFC 2026, October 1996.

   [RFC 2028]  Hovey, R. and S. Bradner, "The Organizations Involved in
               the IETF Standards Process", BCP 11, RFC 2028, October
               1996.

   [RFC 2119]  Bradner, S."Key words for use in RFCs to Indicate
               Requirement Levels", BCP 14, RFC 2119, March 1997.

   [RFC 2234]  Crocker, D. and P. Overell, "Augmented BNF for Syntax
               Specifications: ABNF", RFC 2234, November 1997.

   [RFC 2616]  Fielding, R., Gettys, J., Mogul, J., Frystyk, H.,
               Masinter, L., Leach, P. and T. Berners-Lee, "Hypertext
               Transfer Protocol -- HTTP/1.1", RFC 2616, June 1999.

   [RFC 2860]  Carpenter, B., Baker, F. and M. Roberts, "Memorandum of
               Understanding Concerning the Technical Work of the
               Internet Assigned Numbers Authority", RFC 2860, June
               2000.

Alvestrand               Best Current Practice                 [Page 11]
RFC 3066          Tags for Identification of Languages      January 2001

Appendix A: Language Tag Reference Material

   The Library of Congress, maintainers of ISO 639-2, has made the list
   of languages registered available on the Internet.

   At the time of this writing, it can be found at
   http://www.loc.gov/standards/iso639-2/langhome.html

   The IANA registration forms for registered language codes can be
   found at http://www.iana.org/numbers.html under "languages".

   The ISO 3166 Maintenance Agency has published Web pages at

   http://www.din.de/gremien/nas/nabd/iso3166ma/

Appendix B: Changes from RFC 1766

   - Email list address changed from ietf-types@uninett.no to ietf-
     languages@iana.org

   - Updated author's address

   - Added language-range construct from HTTP/1.1

   - Added use of ISO 639-2 language codes

   - Added reference to Library of Congress lists of language codes

   - Changed examples to use registered tags

   - Added "Any other information" to registration form

   - Added description of procedure for updating registrations

   - Changed target category for document from standards track to BCP

   - Moved the content-language header definition into another document

   - Added numbers to the permitted characters in language tags

Alvestrand               Best Current Practice                 [Page 12]
RFC 3066          Tags for Identification of Languages      January 2001

Full Copyright Statement

   Copyright (C) The Internet Society (2001).  All Rights Reserved.

   This document and translations of it may be copied and furnished to
   others, and derivative works that comment on or otherwise explain it
   or assist in its implementation may be prepared, copied, published
   and distributed, in whole or in part, without restriction of any
   kind, provided that the above copyright notice and this paragraph are
   included on all such copies and derivative works.  However, this
   document itself may not be modified in any way, such as by removing
   the copyright notice or references to the Internet Society or other
   Internet organizations, except as needed for the purpose of
   developing Internet standards in which case the procedures for
   copyrights defined in the Internet Standards process must be
   followed, or as required to translate it into languages other than
   English.

   The limited permissions granted above are perpetual and will not be
   revoked by the Internet Society or its successors or assigns.

   This document and the information contained herein is provided on an
   "AS IS" basis and THE INTERNET SOCIETY AND THE INTERNET ENGINEERING
   TASK FORCE DISCLAIMS ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING
   BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION
   HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF
   MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.

Acknowledgement

   Funding for the RFC Editor function is currently provided by the
   Internet Society.

Alvestrand               Best Current Practice                 [Page 13]