[Clfs-commits] [CLFS Trac] #182: UTF-8
CLFS Trac
trac at cross-lfs.org
Mon Feb 2 11:11:50 PST 2009
#182: UTF-8
--------------------+-------------------------------------------------------
Reporter: ken | Owner: clfs-commits at lists.cross-lfs.org.
Type: task | Status: new
Priority: minor | Milestone: CLFS Standard 1.2.0
Component: BOOK | Version: CLFS Standard SVN
Resolution: | Keywords:
--------------------+-------------------------------------------------------
Comment (by ken):
I had 14 failures in glibc/localedata tests. Possibly related to the sed
for UTF-8, don't know. Haven't really looked at the other testsuites yet,
beyond a quick glance at gcc (failures in mudflap, i think - similar to
4.3.2) and binutils (more failures than before, probably because this is
x86_64-64). Maybe, if I can find the enthusiasm to rebuild the final
system without that sed, I might do so to see what changes in the tests.
The variables in the i18n script were mostly new to me - I've never
encountered people adding extra partial maps, I thought normal maps
already included things like the euro - but not a problem. I do have a
problem with defaulting to the 'windowkeys', but that isn't major and
easily altered by the user. I'll also note that my Compose key in my local
keymap doesn't seem to be working on this build - the dead keys for
latin1-compatible accents are still working, forgot to test the things
I've added to AltGr.
On to the main test of man pages:
Perhaps I'm missing something, e.g. maybe groff now needs some sort of
override or post-install configuration to use the new functionality ? For
the moment, my test results in everything except en_GB.UTF-8 are labelled
as "unusable".
I wanted to test the following areas:
possible unrendered text in the error message for 'man foo' because
man-1.6f provides localized messages in legacy encodings. I have some
(long) iconv invocations that I normally use to fix this.
possible unrendered text for 'man apropos' and the other pages supplied
with man-1.6f in legacy encodings. Again, I have some iconv invocations to
fix this.
man-pages for vim might need to be moved to be found (fr,it,pl,ru).
man-pages from shadow (typically, tested with passwd (5)) are supplied as
UTF-8. The scripts (ja, ko, zh_*) can't be rendered in the console, and in
the event I didn't think it was worthwhile to build xorg because of the
errors I found.
Results (testing with LC_ALL=whatever man something)
'gibberish' means the text rendered, typically as a mixture of accented
vowels and occasional
icelandic letters, but was clearly non-sensical for the language.
'invalid UTF-8' means one or more inverse '?' symbols, caused by invalid
UTF-8 characters arriving at the pager.
'<xx>' means the pager displayed these hex values in inverse video
These were all tested using my (UTF-8) sigma-general 'font', which can
handle the main latin and cyrillic languages as well as monotonic greek.
bg_BG.UTF-8 man foo invalid UTF-8, man 1 apropos gibberish
cs_CZ.UTF-8 man foo invalid UTF-8, man 1 apropos invalid UTF-8, man 5
passwd mix of invalid UTF-8 and gibberish
da_DK.UTF-8 man foo ok (all ascii), man 1 apropos ok
de_DE.UTF-8 man foo ok, man 1 apropos ok, man 5 passwd gibberish and
invalid UTF-8
el_GR.UTF-8 man foo invalid UTF-8, man 1 apropos gibberish
es_ES.UTF-8 man foo invalid UTF-8, man 1 apropos and man 5 passwd <C3><93>
etc
fi_FI.UTF-8 similar to es_ES.UTF-8
fr_FR.UTF-8 man foo ok (ascii), man 1 apropos, man 1 vim, man 5 passwd
<C3><A9> etc
interestingly, if I point man to fr_FR.ISO8859-1/man1/vim.1 it renders
correctly.
At this point I gave up. I guessI can fix the 'man foo' issue, but the
rest is so broken I don't know where to begin looking.
--
Ticket URL: <http://trac.cross-lfs.org/ticket/182#comment:16>
CLFS Trac <http://trac.cross-lfs.org>
The Cross Linux From Scratch Project.
More information about the Clfs-commits
mailing list