This is the mail archive of the
cygwin-apps@cygwin.com
mailing list for the Cygwin project.
[ITP] rxp-1.3.0 - a new package for review (XML parser)
- From: letters at hotpop dot com (Jari Aalto+mail.emacs)
- To: cygwin-apps at cygwin dot com
- Date: Thu, 14 Aug 2003 00:29:28 +0300
- Subject: [ITP] rxp-1.3.0 - a new package for review (XML parser)
Iäve found rxp easier to use than xmlwf (expat), especially when
rxp has more options for verification purposes. I'm attaching
the manual below for people interested in option differences to
expat.
Please review packages below.
Jari
http://tierra.dyndns.org:81/cygwin/rxp/rxp-1.3.0-1-src.tar.bz2
http://tierra.dyndns.org:81/cygwin/rxp/rxp-1.3.0-1.tar.bz2
sdesc: "RXP is simple validating XML parser written by Richard Tobin."
ldesc: "RXP is simple validating XML parser written by Richard Tobin. It supports XML 1.1, Namespaces 1.1, and XML Catalogs."
category: Devel, Doc, Interpreters, Text
requires: cygwin
+ tar jtvf rxp-1.3.0-1.tar.bz2
drwxr-xr-x root/None 0 2003-08-13 23:47:46 usr/
drwxr-xr-x root/None 0 2003-08-13 23:47:46 usr/doc/
drwxr-xr-x root/None 0 2003-08-13 23:47:46 usr/doc/rxp-1.3.0/
-rw-r--r-- root/None 17976 2003-08-13 23:47:46 usr/doc/rxp-1.3.0/COPYING
-rw-r--r-- root/None 570 2003-08-13 23:47:46 usr/doc/rxp-1.3.0/COPYRIGHT
-rw-r--r-- root/None 2042 2003-08-13 23:47:48 usr/doc/rxp-1.3.0/RELNOTES
drwxr-xr-x root/None 0 2003-08-13 23:47:50 usr/doc/Cygwin/
-rw-r--r-- root/None 1146 2003-08-13 23:47:50 usr/doc/Cygwin/rxp-1.3.0.README
drwxr-xr-x root/None 0 2003-08-13 23:47:50 usr/bin/
-rwxr-xr-x root/None 123904 2003-08-13 23:47:52 usr/bin/rxp.exe
drwxr-xr-x root/None 0 2003-08-13 23:47:50 usr/man/
drwxr-xr-x root/None 0 2003-08-13 23:47:50 usr/man/man1/
-rw-r--r-- root/None 3633 2003-08-13 23:47:50 usr/man/man1/rxp.1
NAME
rxp - XML parser program
SYNOPSIS
rxp [ -avVEsbnmtx ] [ -o b|p|0|1|2|3 ] [ -c encoding ] [ url ]
DESCRIPTION
rxp reads and parses XML from the url (or standard input if none is
provided) and writes it to standard output, optionally expanding enti-
ties, defaulting attributes, and translating to a different output
encoding.
OPTIONS
-a Insert declared default values for omitted attributes.
-v Be verbose.
-V Validate the document. Repeating this option will make the pro-
gram treat validity errors as well-formedness errors, and exit
after the first validity error (otherwise a warning will be
printed for each one).
-d Read the whole DTD (internal and external parts) regardless of
any standalone declaration. Otherwise a declaration "stan-
dalone='yes'" will prevent the external part from being read
(unless validation is selected).
-N Enable XML namespace support. The document will be checked for
correct namespace syntax, and if -b is specified qualified ele-
ment and attribute names will be displayed with their URIs.
-S Keep track of xml:space attributes. This will only affect out-
put when -b is specified.
-e Obsolete, do not use.
-E Do not expand entity references (opposite of old -e flag)
-s Be silent (that is, suppress output). Useful for benchmarking
or if you just want to see the error messages.
-b Print output as "bits".
-n Treat the input as normalised SGML rather than XML. Not
intended for general use.
-o If this flag is p, output is in the default (plain) format. If
it is b, output is printed as "bits" (equivalent to -b). If
it is 0, output is suppressed (equivalent to -s). If it is 1, 2
or 3, output is in first, second or third canonical form.
-m Merge PCData across entity references. This will only affect
the output when -b is specified.
-t Read in the input as a tree, rather than bits. Should make no
difference to the output.
-u base_uri
Use the specified base URI when resolving system identifiers.
-x Strict XML mode. This suppresses some warnings (eg entity
redefinitions) but treats all XML well-formedness errors as
fatal. This flag implies the -a flag, and sets the output
encoding to UTF-8 unless the -c flag is given. It sets the out-
put format to first canonical form unless the -o, -b or -s flag
is given.
-c encoding
Produce output in the specified character encoding. Known
encodings include ISO-8859-1, UTF-8, ISO-10646-UCS and UTF-16.
16-bit encoding names my be suffixed with -B or -L to specify
big- or little-endian byte order (the default is the host byte
order). If no -c or -x option is given, output is in the same
encoding as the input document.
-z Use a shorter format for error messages. Particularly useful
when using the parser in Emacs compilation mode, so that Emacs
can find the error location.
--
http://tiny-tools.sourceforge.net/
Swatch @time http://www.ryanthiessen.com/swatch/resources.htm
Convert @time http://www.mir.com.my/iTime/itime.htm