man(1) Manual page archive


     RUNECLASS(2)                                         RUNECLASS(2)

     NAME
          runeclass, runecompose - Unicode character equivalence

     SYNOPSIS
          #include <u.h>
          #include <libc.h>

          Rune *runeclass(Rune r)

          int runecompose(Rune base, Rune combiner)

     DESCRIPTION
          These routines use codepoint properties from the Unicode
          standard to combine and determine sets of characters with
          the same base character.  The set of codepoints with the
          same base codepoint are a generalization of the equivilence
          between various cases such as title, lower an upper:
          runeclass returns this set.  Likewise, runecompose takes a
          base codepoint and a combining codepoint (e.g. u+0308, com-
          bining diaresis) and returns the combined form, if it
          exists.  For example

                    echo e0308 | rune/uconv | rune/compose

     SEE ALSO
          grep(1), rune(1),
          The Unicode Consortium.  The Unicode Standard, Version
          6.0.0, (Mountain View, CA: The Unicode Consortium, 2011.
          ISBN 978-1-936213-01-6)
          http://www.unicode.org/versions/Unicode6.0.0/