• R/O
  • SSH
  • HTTPS

tsukurimashou: Commit


Commit MetaInfo

Revision315 (tree)
Time2012-08-27 06:52:56
Authormskala

Log Message

final stuff for IDSgrep 0.3

Change Summary

Incremental Difference

--- trunk/idsgrep/idsgrep.tex (revision 314)
+++ trunk/idsgrep/idsgrep.tex (revision 315)
@@ -112,6 +112,8 @@
112112
113113 \maketitle
114114
115+\setcounter{page}{2}
116+
115117 \renewcommand{\cfttoctitlefont}{%
116118 \huge\sffamily\bfseries}
117119 \renewcommand{\cftaftertoctitle}{%
@@ -477,7 +479,7 @@
477479 covering multiple languages as part of a larger processing environment that
478480 also includes a version of XEmacs~\cite{XEmacs} modified to follow the
479481 principles of the UTF-2000 initiative~\cite{UTF2000}. It also has
480-connections to GlyphWiki~\cite{GlyphWiki} and the KAGE~\cite{KAGE} project.
482+connections to GlyphWiki~\cite{GlyphWiki}.
481483 These systems are documented primarily in Japanese; English-language
482484 documentation is sparse and not necessarily up to date.
483485
--- trunk/idsgrep/idsgrep.1.in (revision 314)
+++ trunk/idsgrep/idsgrep.1.in (revision 315)
@@ -62,6 +62,41 @@
6262 .
6363 .SH OPTIONS
6464 .TP
65+.BI \-c "FMT\fR,\fP " "\-\^\-cooking=" FMT
66+Specify a mode for cooking of input and output.
67+The
68+.I FMT
69+parameter may be
70+.BR raw ", " rawnc ", " ascii ", " cooked ", " indent ", "
71+or a string of up to 12 decimal digits.
72+By default, or in
73+.B raw
74+or
75+.B rawnc
76+mode,
77+.I @PACKAGE@
78+will write out for each matching tree the exact sequence of bytes that were
79+read and parsed to generate that tree.
80+In the case of invalid input (for instance, malformed UTF-8), that sequence
81+of bytes could be bad.
82+Even with perfectly valid input, the sequence could be unusual, because EIDS
83+syntax in general permits many different strings to have the same semantic
84+value.
85+Setting a non-raw mode for output cooking will cause
86+.I @PACKAGE@
87+to generate its own EIDS syntax, in a guaranteed consistent form, for each
88+matching tree.
89+The
90+.B rawnc
91+mode is identical to
92+.B raw
93+save that it disables canonicalization (ascii-to-symbolic) processing on
94+input.
95+That modification to the syntax is also available in conjunction with output
96+cooking by means of the advanced decimal-string syntax.
97+See the PDF documentation for details of
98+.BR -c "."
99+.TP
65100 .BI \-d "NAME\fR,\fP " "\-\^\-dictionary=" NAME
66101 Choose one or more dictionaries from the default dictionary directory, using
67102 shell glob pattern
@@ -328,8 +363,10 @@
328363 single-character forms.
329364 The idea is to provide human-readable pure-ASCII alternate forms for
330365 the IDCs and matching operators.
331-In the future there may be options to skip this transformation
332-on input, or perform its reverse on output.
366+The
367+.B -c
368+command-line option makes it possibile to skip this transformation on
369+input, or perform its inverse on output.
333370 The list of replacements is: (anything) to (?); .anywhere. to ...; [and] to
334371 [&]; [or] to [|]; .not. to .!.; .regex. to ./.; .equal. to .=.; [lr] to
335372 [<U+2FF0>]; [tb] to [<U+2FF1>]; {lcr} to {<U+2FF2>}; {tcb} to {<U+2FF3>};
@@ -552,11 +589,21 @@
552589 be.
553590 .PP
554591 PCRE, due to the use of C-strings in its API, cannot accept literal zero
555-bytes (<U+0000>) in matching patterns, in even though EIDS syntax would
592+bytes (<U+0000>) in matching patterns, even though EIDS syntax would
556593 otherwise allow them.
557594 Zero bytes are allowed in matching subjects, and may be matched
558595 by writing them in the pattern with PCRE's escape syntax instead of
559596 literally.
597+.PP
598+If used on a system that permits non-UTF-8 byte sequences as filenames, it
599+is possible that
600+.I @PACKAGE@
601+may fail when it attempts to write out such a filename, because of
602+trying to pass the filename through escape-generation logic designed for
603+UTF-8.
604+This issue is hard to test because all modern systems use UTF-8 or some
605+restriction thereof for filenames, but it will probably be revisited, and
606+if it's a problem, fixed, in some future version.
560607 .
561608 .SH AUTHOR
562609 Matthew Skala <mskala@ansuz.sooke.bc.ca>
--- trunk/idsgrep/configure.ac (revision 314)
+++ trunk/idsgrep/configure.ac (revision 315)
@@ -159,7 +159,7 @@
159159 #
160160 AC_PREREQ([2.63])
161161 AC_INIT([IDSgrep],
162- [0.3pre], [mskala@ansuz.sooke.bc.ca], [idsgrep],
162+ [0.3], [mskala@ansuz.sooke.bc.ca], [idsgrep],
163163 [[http://tsukurimashou.sourceforge.jp/]])
164164 AC_PRESERVE_HELP_ORDER
165165 AM_INIT_AUTOMAKE([foreign parallel-tests color-tests])
@@ -168,7 +168,7 @@
168168 AC_CONFIG_MACRO_DIR([m4])
169169 AC_REVISION([$Id: configure.ac 1015 2011-12-15 22:24:32Z mskala $])
170170 AC_COPYRIGHT([Copyright (C) 2012 Matthew Skala])
171-AC_SUBST([release_date],["August 1, 2012"])
171+AC_SUBST([release_date],["August 26, 2012"])
172172 #
173173 ############################################################################
174174 #
Show on old repository browser