This is the mail archive of the cygwin@cygwin.com mailing list for the Cygwin project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: Unicode files

From: Jon LaBadie <jcyg at jgcomp dot com>
To: cygwin at cygwin dot com
Date: Mon, 13 Jan 2003 02:44:25 -0500
Subject: Re: Unicode files
References: <F1B91355CC52D3119412004F4E0286AF78E309@msx1.link.domnet.com.au> <5.2.0.9.2.20030112210002.02720e50@pop3.cris.com>

At 21:18 2003-01-12, Clancy Malcolm wrote:
>Can cygwin programs like grep process a unicode file?
>
>I have a Windows 2000 backup log file which seems to be a unicode file.
>When I cat the file under cygwin it displays with spaces between every
>second character: e.g.
>
>ÿ_B a c k u p   S t a t u s
> O p e r a t i o n :   B a c k u p
> A c t i v e   b a c k u p   d e s t i n a t i o n :   4 m m   D D S
> M e d i a   n a m e :   " T S C 4 - 2 0 0 3 - 0 1 - 1 3 - 2 - 3 5 p "
>
>When I cat in under linux it displays as a normal text file.  I have a
>script which grep's for particular patterns and emails matching lines, but
>it won't work with these log files.
>
>Any suggestions greatly appreciated.

For the ntbtlog.txt file I found this worked as a conversion tool for me.

	tr -d '\0\r' < InputFile > OutputFile

But that is the only unicode file I've had to deal with.

-- 
Jon H. LaBadie                  jcyg@jgcomp.com
 JG Computing
 4455 Province Line Road        (609) 252-0159
 Princeton, NJ  08540-4322      (609) 683-7220 (fax)

--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Bug reporting:         http://cygwin.com/bugs.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/

References:
- Unicode files
  - From: Clancy Malcolm
- Re: Unicode files
  - From: Randall R Schulz

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]