Javascript disabled? Like other modern websites, the IETF Datatracker relies on Javascript. Please enable Javascript for full functionality.

Character Normalization in IETF Protocols
draft-duerst-i18n-norm-04

Versions:

Document	Type	Expired Internet-Draft (individual) Expired & archived
	Authors	Martin J. Dürst , Mark Davis
	Last updated	2000-09-13
	RFC stream	(None)
	Intended RFC status	(None)
	Formats	txt htmlized pdf bibtex bibxml
Stream	Stream state	(No stream defined)
	Consensus boilerplate	Unknown
	RFC Editor Note	(None)
IESG	IESG state	Expired
	Telechat date	(None)
	Responsible AD	(None)
	Send notices to	(None)

Email authors IPR References Referenced by Nits Search email archive

This Internet-Draft is no longer active. A copy of the expired Internet-Draft is available in these formats:

txt htmlized pdf bibtex bibxml

Abstract

The Universal Character Set (UCS) [ISO10646, Unicode] covers a very wide repertoire of characters. The IETF, in [RFC 2277], requires that future IETF protocols support UTF-8 [RFC 2279], an ASCII-compatible encoding of UCS. The wide range of characters included in the UCS has lead to some cases of duplicate encodings. This document proposes that in IETF protocols, the class of duplicates called canonical equivalents be dealt with by using Early Uniform Normalization according to Unicode Normalization Form C, Canonical Composition (NFC) [UTR15]. This document describes both Early Uniform Normalization and Normalization Form C.

Authors

Martin J. Dürst
Mark Davis

(Note: The e-mail addresses provided for the authors of this Internet-Draft may no longer be valid.)

Character Normalization in IETF Protocols draft-duerst-i18n-norm-04

Character Normalization in IETF Protocols
draft-duerst-i18n-norm-04