This is the mail archive of the
cygwin@cygwin.com
mailing list for the Cygwin project.
Re: Unicode files
At 21:18 2003-01-12, Clancy Malcolm wrote:
>Can cygwin programs like grep process a unicode file?
>
>I have a Windows 2000 backup log file which seems to be a unicode file.
>When I cat the file under cygwin it displays with spaces between every
>second character: e.g.
>
>ÿ_B a c k u p S t a t u s
> O p e r a t i o n : B a c k u p
> A c t i v e b a c k u p d e s t i n a t i o n : 4 m m D D S
> M e d i a n a m e : " T S C 4 - 2 0 0 3 - 0 1 - 1 3 - 2 - 3 5 p "
>
>When I cat in under linux it displays as a normal text file. I have a
>script which grep's for particular patterns and emails matching lines, but
>it won't work with these log files.
>
>Any suggestions greatly appreciated.
For the ntbtlog.txt file I found this worked as a conversion tool for me.
tr -d '\0\r' < InputFile > OutputFile
But that is the only unicode file I've had to deal with.
--
Jon H. LaBadie jcyg@jgcomp.com
JG Computing
4455 Province Line Road (609) 252-0159
Princeton, NJ 08540-4322 (609) 683-7220 (fax)
--
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
Bug reporting: http://cygwin.com/bugs.html
Documentation: http://cygwin.com/docs.html
FAQ: http://cygwin.com/faq/