This is the mail archive of the
cygwin
mailing list for the Cygwin project.
Re: Non-trivial Cyrillic question
- From: Roman Belenov <rbelenov at yandex dot ru>
- To: cygwin at cygwin dot com
- Date: Tue, 20 Jan 2004 18:27:18 +0300
- Subject: Re: Non-trivial Cyrillic question
- References: <400D3F1B.4010504@contact.bg>
Alexander Shopov <ash@contact.bg> writes:
> However when I pipe it to less
>
> cat myfile | less
>
> it gets broken
> I get things like: <CF><D0><C5><CF><C8><D1><CA><C0>
You should tell less what characters are supposed to be printable
using LESSCHARSET or LESSCHARDEF environment variable (man less for
details).
> ls *
>
> it is displayed like
> ????.txt
ls uses current locale to determine which characters are
printable. AFAIK cygwin isn't locale-aware (I can be mistaken); the
possible workaround is to use --show-control-chars switch to print all
characters literally.
BTW Is cp1251 the only 8-bit encoding used by Bulgarian locale in MS
Windows ? In Russia two encoding are used - cp866 as OEM (used, e.g.,
in console), cp1251 as ANSI.
--
With regards, Roman.
--
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
Problem reports: http://cygwin.com/problems.html
Documentation: http://cygwin.com/docs.html
FAQ: http://cygwin.com/faq/