[Groonga-commit] ranguba/chupa-text-decomposer-abiword at 8031b88 [master] Import

Back to archive index
Sutou Kouhei null+****@clear*****
Thu Jun 13 16:21:09 JST 2019


Sutou Kouhei	2019-06-13 16:21:09 +0900 (Thu, 13 Jun 2019)

  Revision: 8031b8892ec8286ac20aacf97695e864714c3d23
  https://github.com/ranguba/chupa-text-decomposer-abiword/commit/8031b8892ec8286ac20aacf97695e864714c3d23

  Message:
    Import

  Added files:
    .gitignore
    .travis.yml
    .yardopts
    Gemfile
    LICENSE.txt
    README.md
    Rakefile
    chupa-text-decomposer-abiword.gemspec
    doc/text/news.md
    lib/chupa-text/decomposers/abiword.rb
    test/fixture/abw/multi-pages.abw
    test/fixture/abw/one-page.abw
    test/fixture/doc/multi-pages.doc
    test/fixture/doc/one-page.doc
    test/fixture/docx/multi-pages.docx
    test/fixture/docx/one-page.docx
    test/fixture/odt/multi-pages.odt
    test/fixture/odt/one-page.odt
    test/fixture/rtf/multi-pages.rtf
    test/fixture/rtf/one-page.rtf
    test/fixture/zabw/multi-pages.zabw
    test/fixture/zabw/one-page.zabw
    test/helper.rb
    test/run-test.rb
    test/test-abw.rb
    test/test-doc.rb
    test/test-docx.rb
    test/test-odt.rb
    test/test-rtf.rb
    test/test-zabw.rb

  Added: .gitignore (+4 -0) 100644
===================================================================
--- /dev/null
+++ .gitignore    2019-06-13 16:21:09 +0900 (327ad4d)
@@ -0,0 +1,4 @@
+/doc/reference/
+/.yardoc/
+/pkg/
+/Gemfile.lock

  Added: .travis.yml (+12 -0) 100644
===================================================================
--- /dev/null
+++ .travis.yml    2019-06-13 16:21:09 +0900 (5cbcb34)
@@ -0,0 +1,12 @@
+notifications:
+  webhooks:
+    - https://webhook.commit-email.info/
+dist: xenial
+addons:
+  apt:
+    packages:
+      - abiword
+rvm:
+  - 2.4
+  - 2.5
+  - 2.6

  Added: .yardopts (+5 -0) 100644
===================================================================
--- /dev/null
+++ .yardopts    2019-06-13 16:21:09 +0900 (7861910)
@@ -0,0 +1,5 @@
+--output-dir doc/reference/en
+--markup markdown
+--markup-provider kramdown
+-
+doc/text/*

  Added: Gemfile (+27 -0) 100644
===================================================================
--- /dev/null
+++ Gemfile    2019-06-13 16:21:09 +0900 (edbc737)
@@ -0,0 +1,27 @@
+# -*- ruby -*-
+#
+# Copyright (C) 2019  Sutou Kouhei <kou****@clear*****>
+#
+# This library is free software; you can redistribute it and/or
+# modify it under the terms of the GNU Lesser General Public
+# License as published by the Free Software Foundation; either
+# version 2.1 of the License, or (at your option) any later version.
+#
+# This library is distributed in the hope that it will be useful,
+# but WITHOUT ANY WARRANTY; without even the implied warranty of
+# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
+# Lesser General Public License for more details.
+#
+# You should have received a copy of the GNU Lesser General Public
+# License along with this library; if not, write to the Free Software
+# Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA  02110-1301  USA
+
+source "https://rubygems.org/"
+
+gemspec
+
+base_dir = File.dirname(__FILE__)
+local_chupa_text_dir = File.join(base_dir, "..", "chupa-text")
+if File.exist?(local_chupa_text_dir)
+  gem "chupa-text", :path => local_chupa_text_dir
+end

  Added: LICENSE.txt (+502 -0) 100644
===================================================================
--- /dev/null
+++ LICENSE.txt    2019-06-13 16:21:09 +0900 (4362b49)
@@ -0,0 +1,502 @@
+                  GNU LESSER GENERAL PUBLIC LICENSE
+                       Version 2.1, February 1999
+
+ Copyright (C) 1991, 1999 Free Software Foundation, Inc.
+ 51 Franklin Street, Fifth Floor, Boston, MA  02110-1301  USA
+ Everyone is permitted to copy and distribute verbatim copies
+ of this license document, but changing it is not allowed.
+
+[This is the first released version of the Lesser GPL.  It also counts
+ as the successor of the GNU Library Public License, version 2, hence
+ the version number 2.1.]
+
+                            Preamble
+
+  The licenses for most software are designed to take away your
+freedom to share and change it.  By contrast, the GNU General Public
+Licenses are intended to guarantee your freedom to share and change
+free software--to make sure the software is free for all its users.
+
+  This license, the Lesser General Public License, applies to some
+specially designated software packages--typically libraries--of the
+Free Software Foundation and other authors who decide to use it.  You
+can use it too, but we suggest you first think carefully about whether
+this license or the ordinary General Public License is the better
+strategy to use in any particular case, based on the explanations below.
+
+  When we speak of free software, we are referring to freedom of use,
+not price.  Our General Public Licenses are designed to make sure that
+you have the freedom to distribute copies of free software (and charge
+for this service if you wish); that you receive source code or can get
+it if you want it; that you can change the software and use pieces of
+it in new free programs; and that you are informed that you can do
+these things.
+
+  To protect your rights, we need to make restrictions that forbid
+distributors to deny you these rights or to ask you to surrender these
+rights.  These restrictions translate to certain responsibilities for
+you if you distribute copies of the library or if you modify it.
+
+  For example, if you distribute copies of the library, whether gratis
+or for a fee, you must give the recipients all the rights that we gave
+you.  You must make sure that they, too, receive or can get the source
+code.  If you link other code with the library, you must provide
+complete object files to the recipients, so that they can relink them
+with the library after making changes to the library and recompiling
+it.  And you must show them these terms so they know their rights.
+
+  We protect your rights with a two-step method: (1) we copyright the
+library, and (2) we offer you this license, which gives you legal
+permission to copy, distribute and/or modify the library.
+
+  To protect each distributor, we want to make it very clear that
+there is no warranty for the free library.  Also, if the library is
+modified by someone else and passed on, the recipients should know
+that what they have is not the original version, so that the original
+author's reputation will not be affected by problems that might be
+introduced by others.
+
+  Finally, software patents pose a constant threat to the existence of
+any free program.  We wish to make sure that a company cannot
+effectively restrict the users of a free program by obtaining a
+restrictive license from a patent holder.  Therefore, we insist that
+any patent license obtained for a version of the library must be
+consistent with the full freedom of use specified in this license.
+
+  Most GNU software, including some libraries, is covered by the
+ordinary GNU General Public License.  This license, the GNU Lesser
+General Public License, applies to certain designated libraries, and
+is quite different from the ordinary General Public License.  We use
+this license for certain libraries in order to permit linking those
+libraries into non-free programs.
+
+  When a program is linked with a library, whether statically or using
+a shared library, the combination of the two is legally speaking a
+combined work, a derivative of the original library.  The ordinary
+General Public License therefore permits such linking only if the
+entire combination fits its criteria of freedom.  The Lesser General
+Public License permits more lax criteria for linking other code with
+the library.
+
+  We call this license the "Lesser" General Public License because it
+does Less to protect the user's freedom than the ordinary General
+Public License.  It also provides other free software developers Less
+of an advantage over competing non-free programs.  These disadvantages
+are the reason we use the ordinary General Public License for many
+libraries.  However, the Lesser license provides advantages in certain
+special circumstances.
+
+  For example, on rare occasions, there may be a special need to
+encourage the widest possible use of a certain library, so that it becomes
+a de-facto standard.  To achieve this, non-free programs must be
+allowed to use the library.  A more frequent case is that a free
+library does the same job as widely used non-free libraries.  In this
+case, there is little to gain by limiting the free library to free
+software only, so we use the Lesser General Public License.
+
+  In other cases, permission to use a particular library in non-free
+programs enables a greater number of people to use a large body of
+free software.  For example, permission to use the GNU C Library in
+non-free programs enables many more people to use the whole GNU
+operating system, as well as its variant, the GNU/Linux operating
+system.
+
+  Although the Lesser General Public License is Less protective of the
+users' freedom, it does ensure that the user of a program that is
+linked with the Library has the freedom and the wherewithal to run
+that program using a modified version of the Library.
+
+  The precise terms and conditions for copying, distribution and
+modification follow.  Pay close attention to the difference between a
+"work based on the library" and a "work that uses the library".  The
+former contains code derived from the library, whereas the latter must
+be combined with the library in order to run.
+
+                  GNU LESSER GENERAL PUBLIC LICENSE
+   TERMS AND CONDITIONS FOR COPYING, DISTRIBUTION AND MODIFICATION
+
+  0. This License Agreement applies to any software library or other
+program which contains a notice placed by the copyright holder or
+other authorized party saying it may be distributed under the terms of
+this Lesser General Public License (also called "this License").
+Each licensee is addressed as "you".
+
+  A "library" means a collection of software functions and/or data
+prepared so as to be conveniently linked with application programs
+(which use some of those functions and data) to form executables.
+
+  The "Library", below, refers to any such software library or work
+which has been distributed under these terms.  A "work based on the
+Library" means either the Library or any derivative work under
+copyright law: that is to say, a work containing the Library or a
+portion of it, either verbatim or with modifications and/or translated
+straightforwardly into another language.  (Hereinafter, translation is
+included without limitation in the term "modification".)
+
+  "Source code" for a work means the preferred form of the work for
+making modifications to it.  For a library, complete source code means
+all the source code for all modules it contains, plus any associated
+interface definition files, plus the scripts used to control compilation
+and installation of the library.
+
+  Activities other than copying, distribution and modification are not
+covered by this License; they are outside its scope.  The act of
+running a program using the Library is not restricted, and output from
+such a program is covered only if its contents constitute a work based
+on the Library (independent of the use of the Library in a tool for
+writing it).  Whether that is true depends on what the Library does
+and what the program that uses the Library does.
+
+  1. You may copy and distribute verbatim copies of the Library's
+complete source code as you receive it, in any medium, provided that
+you conspicuously and appropriately publish on each copy an
+appropriate copyright notice and disclaimer of warranty; keep intact
+all the notices that refer to this License and to the absence of any
+warranty; and distribute a copy of this License along with the
+Library.
+
+  You may charge a fee for the physical act of transferring a copy,
+and you may at your option offer warranty protection in exchange for a
+fee.
+
+  2. You may modify your copy or copies of the Library or any portion
+of it, thus forming a work based on the Library, and copy and
+distribute such modifications or work under the terms of Section 1
+above, provided that you also meet all of these conditions:
+
+    a) The modified work must itself be a software library.
+
+    b) You must cause the files modified to carry prominent notices
+    stating that you changed the files and the date of any change.
+
+    c) You must cause the whole of the work to be licensed at no
+    charge to all third parties under the terms of this License.
+
+    d) If a facility in the modified Library refers to a function or a
+    table of data to be supplied by an application program that uses
+    the facility, other than as an argument passed when the facility
+    is invoked, then you must make a good faith effort to ensure that,
+    in the event an application does not supply such function or
+    table, the facility still operates, and performs whatever part of
+    its purpose remains meaningful.
+
+    (For example, a function in a library to compute square roots has
+    a purpose that is entirely well-defined independent of the
+    application.  Therefore, Subsection 2d requires that any
+    application-supplied function or table used by this function must
+    be optional: if the application does not supply it, the square
+    root function must still compute square roots.)
+
+These requirements apply to the modified work as a whole.  If
+identifiable sections of that work are not derived from the Library,
+and can be reasonably considered independent and separate works in
+themselves, then this License, and its terms, do not apply to those
+sections when you distribute them as separate works.  But when you
+distribute the same sections as part of a whole which is a work based
+on the Library, the distribution of the whole must be on the terms of
+this License, whose permissions for other licensees extend to the
+entire whole, and thus to each and every part regardless of who wrote
+it.
+
+Thus, it is not the intent of this section to claim rights or contest
+your rights to work written entirely by you; rather, the intent is to
+exercise the right to control the distribution of derivative or
+collective works based on the Library.
+
+In addition, mere aggregation of another work not based on the Library
+with the Library (or with a work based on the Library) on a volume of
+a storage or distribution medium does not bring the other work under
+the scope of this License.
+
+  3. You may opt to apply the terms of the ordinary GNU General Public
+License instead of this License to a given copy of the Library.  To do
+this, you must alter all the notices that refer to this License, so
+that they refer to the ordinary GNU General Public License, version 2,
+instead of to this License.  (If a newer version than version 2 of the
+ordinary GNU General Public License has appeared, then you can specify
+that version instead if you wish.)  Do not make any other change in
+these notices.
+
+  Once this change is made in a given copy, it is irreversible for
+that copy, so the ordinary GNU General Public License applies to all
+subsequent copies and derivative works made from that copy.
+
+  This option is useful when you wish to copy part of the code of
+the Library into a program that is not a library.
+
+  4. You may copy and distribute the Library (or a portion or
+derivative of it, under Section 2) in object code or executable form
+under the terms of Sections 1 and 2 above provided that you accompany
+it with the complete corresponding machine-readable source code, which
+must be distributed under the terms of Sections 1 and 2 above on a
+medium customarily used for software interchange.
+
+  If distribution of object code is made by offering access to copy
+from a designated place, then offering equivalent access to copy the
+source code from the same place satisfies the requirement to
+distribute the source code, even though third parties are not
+compelled to copy the source along with the object code.
+
+  5. A program that contains no derivative of any portion of the
+Library, but is designed to work with the Library by being compiled or
+linked with it, is called a "work that uses the Library".  Such a
+work, in isolation, is not a derivative work of the Library, and
+therefore falls outside the scope of this License.
+
+  However, linking a "work that uses the Library" with the Library
+creates an executable that is a derivative of the Library (because it
+contains portions of the Library), rather than a "work that uses the
+library".  The executable is therefore covered by this License.
+Section 6 states terms for distribution of such executables.
+
+  When a "work that uses the Library" uses material from a header file
+that is part of the Library, the object code for the work may be a
+derivative work of the Library even though the source code is not.
+Whether this is true is especially significant if the work can be
+linked without the Library, or if the work is itself a library.  The
+threshold for this to be true is not precisely defined by law.
+
+  If such an object file uses only numerical parameters, data
+structure layouts and accessors, and small macros and small inline
+functions (ten lines or less in length), then the use of the object
+file is unrestricted, regardless of whether it is legally a derivative
+work.  (Executables containing this object code plus portions of the
+Library will still fall under Section 6.)
+
+  Otherwise, if the work is a derivative of the Library, you may
+distribute the object code for the work under the terms of Section 6.
+Any executables containing that work also fall under Section 6,
+whether or not they are linked directly with the Library itself.
+
+  6. As an exception to the Sections above, you may also combine or
+link a "work that uses the Library" with the Library to produce a
+work containing portions of the Library, and distribute that work
+under terms of your choice, provided that the terms permit
+modification of the work for the customer's own use and reverse
+engineering for debugging such modifications.
+
+  You must give prominent notice with each copy of the work that the
+Library is used in it and that the Library and its use are covered by
+this License.  You must supply a copy of this License.  If the work
+during execution displays copyright notices, you must include the
+copyright notice for the Library among them, as well as a reference
+directing the user to the copy of this License.  Also, you must do one
+of these things:
+
+    a) Accompany the work with the complete corresponding
+    machine-readable source code for the Library including whatever
+    changes were used in the work (which must be distributed under
+    Sections 1 and 2 above); and, if the work is an executable linked
+    with the Library, with the complete machine-readable "work that
+    uses the Library", as object code and/or source code, so that the
+    user can modify the Library and then relink to produce a modified
+    executable containing the modified Library.  (It is understood
+    that the user who changes the contents of definitions files in the
+    Library will not necessarily be able to recompile the application
+    to use the modified definitions.)
+
+    b) Use a suitable shared library mechanism for linking with the
+    Library.  A suitable mechanism is one that (1) uses at run time a
+    copy of the library already present on the user's computer system,
+    rather than copying library functions into the executable, and (2)
+    will operate properly with a modified version of the library, if
+    the user installs one, as long as the modified version is
+    interface-compatible with the version that the work was made with.
+
+    c) Accompany the work with a written offer, valid for at
+    least three years, to give the same user the materials
+    specified in Subsection 6a, above, for a charge no more
+    than the cost of performing this distribution.
+
+    d) If distribution of the work is made by offering access to copy
+    from a designated place, offer equivalent access to copy the above
+    specified materials from the same place.
+
+    e) Verify that the user has already received a copy of these
+    materials or that you have already sent this user a copy.
+
+  For an executable, the required form of the "work that uses the
+Library" must include any data and utility programs needed for
+reproducing the executable from it.  However, as a special exception,
+the materials to be distributed need not include anything that is
+normally distributed (in either source or binary form) with the major
+components (compiler, kernel, and so on) of the operating system on
+which the executable runs, unless that component itself accompanies
+the executable.
+
+  It may happen that this requirement contradicts the license
+restrictions of other proprietary libraries that do not normally
+accompany the operating system.  Such a contradiction means you cannot
+use both them and the Library together in an executable that you
+distribute.
+
+  7. You may place library facilities that are a work based on the
+Library side-by-side in a single library together with other library
+facilities not covered by this License, and distribute such a combined
+library, provided that the separate distribution of the work based on
+the Library and of the other library facilities is otherwise
+permitted, and provided that you do these two things:
+
+    a) Accompany the combined library with a copy of the same work
+    based on the Library, uncombined with any other library
+    facilities.  This must be distributed under the terms of the
+    Sections above.
+
+    b) Give prominent notice with the combined library of the fact
+    that part of it is a work based on the Library, and explaining
+    where to find the accompanying uncombined form of the same work.
+
+  8. You may not copy, modify, sublicense, link with, or distribute
+the Library except as expressly provided under this License.  Any
+attempt otherwise to copy, modify, sublicense, link with, or
+distribute the Library is void, and will automatically terminate your
+rights under this License.  However, parties who have received copies,
+or rights, from you under this License will not have their licenses
+terminated so long as such parties remain in full compliance.
+
+  9. You are not required to accept this License, since you have not
+signed it.  However, nothing else grants you permission to modify or
+distribute the Library or its derivative works.  These actions are
+prohibited by law if you do not accept this License.  Therefore, by
+modifying or distributing the Library (or any work based on the
+Library), you indicate your acceptance of this License to do so, and
+all its terms and conditions for copying, distributing or modifying
+the Library or works based on it.
+
+  10. Each time you redistribute the Library (or any work based on the
+Library), the recipient automatically receives a license from the
+original licensor to copy, distribute, link with or modify the Library
+subject to these terms and conditions.  You may not impose any further
+restrictions on the recipients' exercise of the rights granted herein.
+You are not responsible for enforcing compliance by third parties with
+this License.
+
+  11. If, as a consequence of a court judgment or allegation of patent
+infringement or for any other reason (not limited to patent issues),
+conditions are imposed on you (whether by court order, agreement or
+otherwise) that contradict the conditions of this License, they do not
+excuse you from the conditions of this License.  If you cannot
+distribute so as to satisfy simultaneously your obligations under this
+License and any other pertinent obligations, then as a consequence you
+may not distribute the Library at all.  For example, if a patent
+license would not permit royalty-free redistribution of the Library by
+all those who receive copies directly or indirectly through you, then
+the only way you could satisfy both it and this License would be to
+refrain entirely from distribution of the Library.
+
+If any portion of this section is held invalid or unenforceable under any
+particular circumstance, the balance of the section is intended to apply,
+and the section as a whole is intended to apply in other circumstances.
+
+It is not the purpose of this section to induce you to infringe any
+patents or other property right claims or to contest validity of any
+such claims; this section has the sole purpose of protecting the
+integrity of the free software distribution system which is
+implemented by public license practices.  Many people have made
+generous contributions to the wide range of software distributed
+through that system in reliance on consistent application of that
+system; it is up to the author/donor to decide if he or she is willing
+to distribute software through any other system and a licensee cannot
+impose that choice.
+
+This section is intended to make thoroughly clear what is believed to
+be a consequence of the rest of this License.
+
+  12. If the distribution and/or use of the Library is restricted in
+certain countries either by patents or by copyrighted interfaces, the
+original copyright holder who places the Library under this License may add
+an explicit geographical distribution limitation excluding those countries,
+so that distribution is permitted only in or among countries not thus
+excluded.  In such case, this License incorporates the limitation as if
+written in the body of this License.
+
+  13. The Free Software Foundation may publish revised and/or new
+versions of the Lesser General Public License from time to time.
+Such new versions will be similar in spirit to the present version,
+but may differ in detail to address new problems or concerns.
+
+Each version is given a distinguishing version number.  If the Library
+specifies a version number of this License which applies to it and
+"any later version", you have the option of following the terms and
+conditions either of that version or of any later version published by
+the Free Software Foundation.  If the Library does not specify a
+license version number, you may choose any version ever published by
+the Free Software Foundation.
+
+  14. If you wish to incorporate parts of the Library into other free
+programs whose distribution conditions are incompatible with these,
+write to the author to ask for permission.  For software which is
+copyrighted by the Free Software Foundation, write to the Free
+Software Foundation; we sometimes make exceptions for this.  Our
+decision will be guided by the two goals of preserving the free status
+of all derivatives of our free software and of promoting the sharing
+and reuse of software generally.
+
+                            NO WARRANTY
+
+  15. BECAUSE THE LIBRARY IS LICENSED FREE OF CHARGE, THERE IS NO
+WARRANTY FOR THE LIBRARY, TO THE EXTENT PERMITTED BY APPLICABLE LAW.
+EXCEPT WHEN OTHERWISE STATED IN WRITING THE COPYRIGHT HOLDERS AND/OR
+OTHER PARTIES PROVIDE THE LIBRARY "AS IS" WITHOUT WARRANTY OF ANY
+KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE
+IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR
+PURPOSE.  THE ENTIRE RISK AS TO THE QUALITY AND PERFORMANCE OF THE
+LIBRARY IS WITH YOU.  SHOULD THE LIBRARY PROVE DEFECTIVE, YOU ASSUME
+THE COST OF ALL NECESSARY SERVICING, REPAIR OR CORRECTION.
+
+  16. IN NO EVENT UNLESS REQUIRED BY APPLICABLE LAW OR AGREED TO IN
+WRITING WILL ANY COPYRIGHT HOLDER, OR ANY OTHER PARTY WHO MAY MODIFY
+AND/OR REDISTRIBUTE THE LIBRARY AS PERMITTED ABOVE, BE LIABLE TO YOU
+FOR DAMAGES, INCLUDING ANY GENERAL, SPECIAL, INCIDENTAL OR
+CONSEQUENTIAL DAMAGES ARISING OUT OF THE USE OR INABILITY TO USE THE
+LIBRARY (INCLUDING BUT NOT LIMITED TO LOSS OF DATA OR DATA BEING
+RENDERED INACCURATE OR LOSSES SUSTAINED BY YOU OR THIRD PARTIES OR A
+FAILURE OF THE LIBRARY TO OPERATE WITH ANY OTHER SOFTWARE), EVEN IF
+SUCH HOLDER OR OTHER PARTY HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH
+DAMAGES.
+
+                     END OF TERMS AND CONDITIONS
+
+           How to Apply These Terms to Your New Libraries
+
+  If you develop a new library, and you want it to be of the greatest
+possible use to the public, we recommend making it free software that
+everyone can redistribute and change.  You can do so by permitting
+redistribution under these terms (or, alternatively, under the terms of the
+ordinary General Public License).
+
+  To apply these terms, attach the following notices to the library.  It is
+safest to attach them to the start of each source file to most effectively
+convey the exclusion of warranty; and each file should have at least the
+"copyright" line and a pointer to where the full notice is found.
+
+    <one line to give the library's name and a brief idea of what it does.>
+    Copyright (C) <year>  <name of author>
+
+    This library is free software; you can redistribute it and/or
+    modify it under the terms of the GNU Lesser General Public
+    License as published by the Free Software Foundation; either
+    version 2.1 of the License, or (at your option) any later version.
+
+    This library is distributed in the hope that it will be useful,
+    but WITHOUT ANY WARRANTY; without even the implied warranty of
+    MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
+    Lesser General Public License for more details.
+
+    You should have received a copy of the GNU Lesser General Public
+    License along with this library; if not, write to the Free Software
+    Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA  02110-1301  USA
+
+Also add information on how to contact you by electronic and paper mail.
+
+You should also get your employer (if you work as a programmer) or your
+school, if any, to sign a "copyright disclaimer" for the library, if
+necessary.  Here is a sample; alter the names:
+
+  Yoyodyne, Inc., hereby disclaims all copyright interest in the
+  library `Frob' (a library for tweaking knobs) written by James Random Hacker.
+
+  <signature of Ty Coon>, 1 April 1990
+  Ty Coon, President of Vice
+
+That's all there is to it!

  Added: README.md (+38 -0) 100644
===================================================================
--- /dev/null
+++ README.md    2019-06-13 16:21:09 +0900 (af52ba4)
@@ -0,0 +1,38 @@
+# README
+
+## Name
+
+chupa-text-decomposer-abiword
+
+## Description
+
+This is a ChupaText decomposer plugin for to extract text and
+meta-data from office documents such as Microsoft Word files and
+LibreOffice Writer files.
+
+You can use `abiword` decomposer.
+
+## Install
+
+Install chupa-text-decomposer-abiword gem:
+
+```
+% gem install chupa-text-decomposer-abiword
+```
+
+Now, you can extract text and meta-data from office documents:
+
+```
+% chupa-text document.doc
+```
+
+## Author
+
+  * Sutou Kouhei `<kou****@clear*****>`
+
+## License
+
+LGPL 2.1 or later.
+
+(Sutou Kouhei has a right to change the license including contributed
+patches.)

  Added: Rakefile (+48 -0) 100644
===================================================================
--- /dev/null
+++ Rakefile    2019-06-13 16:21:09 +0900 (1225f9b)
@@ -0,0 +1,48 @@
+# -*- ruby -*-
+#
+# Copyright (C) 2019  Sutou Kouhei <kou****@clear*****>
+#
+# This library is free software; you can redistribute it and/or
+# modify it under the terms of the GNU Lesser General Public
+# License as published by the Free Software Foundation; either
+# version 2.1 of the License, or (at your option) any later version.
+#
+# This library is distributed in the hope that it will be useful,
+# but WITHOUT ANY WARRANTY; without even the implied warranty of
+# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
+# Lesser General Public License for more details.
+#
+# You should have received a copy of the GNU Lesser General Public
+# License along with this library; if not, write to the Free Software
+# Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA  02110-1301  USA
+
+task :default => :test
+
+require "pathname"
+
+require "rubygems"
+require "bundler/gem_helper"
+require "packnga"
+
+base_dir = Pathname(__FILE__).dirname
+
+helper = Bundler::GemHelper.new(base_dir.to_s)
+def helper.version_tag
+  version
+end
+
+helper.install
+spec = helper.gemspec
+
+Packnga::DocumentTask.new(spec) do |task|
+  task.original_language = "en"
+  task.translate_language = "ja"
+end
+
+Packnga::ReleaseTask.new(spec) do
+end
+
+desc "Run tests"
+task :test do
+  ruby("test/run-test.rb")
+end

  Added: chupa-text-decomposer-abiword.gemspec (+50 -0) 100644
===================================================================
--- /dev/null
+++ chupa-text-decomposer-abiword.gemspec    2019-06-13 16:21:09 +0900 (8f30aa0)
@@ -0,0 +1,50 @@
+# -*- ruby -*-
+#
+# Copyright (C) 2019  Sutou Kouhei <kou****@clear*****>
+#
+# This library is free software; you can redistribute it and/or
+# modify it under the terms of the GNU Lesser General Public
+# License as published by the Free Software Foundation; either
+# version 2.1 of the License, or (at your option) any later version.
+#
+# This library is distributed in the hope that it will be useful,
+# but WITHOUT ANY WARRANTY; without even the implied warranty of
+# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
+# Lesser General Public License for more details.
+#
+# You should have received a copy of the GNU Lesser General Public
+# License along with this library; if not, write to the Free Software
+# Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA  02110-1301  USA
+
+clean_white_space = lambda do |entry|
+  entry.gsub(/(\A\n+|\n+\z)/, '') + "\n"
+end
+
+Gem::Specification.new do |spec|
+  spec.name = "chupa-text-decomposer-abiword"
+  spec.version = "1.0.0"
+  spec.homepage = "https://github.com/ranguba/chupa-text-decomposer-abiword"
+  spec.authors = ["Sutou Kouhei"]
+  spec.email = ["kou****@clear*****"]
+  readme = File.read("README.md", encoding: "UTF-8")
+  entries = readme.split(/^\#\#\s(.*)$/)
+  description = clean_white_space.call(entries[entries.index("Description") + 1])
+  spec.summary = description.split(/\n\n+/, 2).first
+  spec.description = description
+  spec.license = "LGPL-2.1+"
+  spec.files = ["#{spec.name}.gemspec"]
+  spec.files += ["README.md", "LICENSE.txt", "Rakefile", "Gemfile"]
+  spec.files += [".yardopts"]
+  spec.files += Dir.glob("lib/**/*.rb")
+  spec.files += Dir.glob("doc/text/*")
+  spec.files += Dir.glob("test/**/*")
+
+  spec.add_runtime_dependency("chupa-text")
+  spec.add_runtime_dependency("chupa-text-decomposer-pdf")
+
+  spec.add_development_dependency("bundler")
+  spec.add_development_dependency("rake")
+  spec.add_development_dependency("test-unit")
+  spec.add_development_dependency("packnga")
+  spec.add_development_dependency("kramdown")
+end

  Added: doc/text/news.md (+5 -0) 100644
===================================================================
--- /dev/null
+++ doc/text/news.md    2019-06-13 16:21:09 +0900 (3f1bede)
@@ -0,0 +1,5 @@
+# News
+
+## 1.0.0: 2019-06-13
+
+The first release!!!

  Added: lib/chupa-text/decomposers/abiword.rb (+132 -0) 100644
===================================================================
--- /dev/null
+++ lib/chupa-text/decomposers/abiword.rb    2019-06-13 16:21:09 +0900 (4a5e473)
@@ -0,0 +1,132 @@
+# Copyright (C) 2019  Sutou Kouhei <kou****@clear*****>
+#
+# This library is free software; you can redistribute it and/or
+# modify it under the terms of the GNU Lesser General Public
+# License as published by the Free Software Foundation; either
+# version 2.1 of the License, or (at your option) any later version.
+#
+# This library is distributed in the hope that it will be useful,
+# but WITHOUT ANY WARRANTY; without even the implied warranty of
+# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
+# Lesser General Public License for more details.
+#
+# You should have received a copy of the GNU Lesser General Public
+# License along with this library; if not, write to the Free Software
+# Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA  02110-1301  USA
+
+require "tempfile"
+
+module ChupaText
+  module Decomposers
+    class AbiWord < Decomposer
+      include Loggable
+
+      registry.register("abiword", self)
+
+      EXTENSIONS = [
+        "abw",
+        "doc",
+        "docx",
+        "odt",
+        "rtf",
+        "zabw",
+      ]
+      MIME_TYPES = [
+        "application/msword",
+        "application/rtf",
+        "application/vnd.oasis.opendocument.text",
+        "application/vnd.openxmlformats-officedocument.wordprocessingml.document",
+        "application/x-abiword",
+      ]
+
+      def initialize(options)
+        super
+        @command = find_command
+        debug do
+          if @command
+            "#{log_tag}[command][found] #{@command.path}"
+          else
+            "#{log_tag}[command][not-found]"
+          end
+        end
+      end
+
+      def target?(data)
+        return false if****@comma*****?
+        EXTENSIONS.include?(data.extension) or
+          MIME_TYPES.include?(data.mime_type)
+      end
+
+      def decompose(data)
+        pdf_data = convert_to_pdf(data)
+        return if pdf_data.nil?
+        yield(pdf_data)
+      end
+
+      private
+      def find_command
+        candidates = [
+          @options[:abiword],
+          ENV["ABIWORD"],
+          "abiword",
+        ]
+        candidates.each do |candidate|
+          next if candidate.nil?
+          command = ExternalCommand.new(candidate)
+          return command if command.exist?
+        end
+        nil
+      end
+
+      def convert_to_pdf(data)
+        create_tempfiles(data) do |pdf, stdout, stderr|
+          succeeded =****@comma*****("--to", "pdf",
+                                   "--to-name", pdf.path,
+                                   data.path.to_s,
+                                   {
+                                     data: data,
+                                     spawn_options: {
+                                       out: stdout.path,
+                                       err: stderr.path,
+                                     },
+                                   })
+          unless succeeded
+            error do
+              tag = "#{log_tag}[convert][exited][abnormally]"
+              [
+                tag,
+                "output: <#{stdout.read}>",
+                "error: <#{stderr.read}>",
+              ].join("\n")
+            end
+            return nil
+          end
+          normalized_pdf_uri = data.uri.to_s.gsub(/\.[^.]+\z/, ".pdf")
+          File.open(pdf.path, "rb") do |pdf_input|
+            VirtualFileData.new(normalized_pdf_uri,
+                                pdf_input,
+                                source_data: data)
+          end
+        end
+      end
+
+      def create_tempfiles(data)
+        basename = File.basename(data.path)
+        pdf = Tempfile.new([basename, ".pdf"])
+        stdout = Tempfile.new([basename, ".stdout.log"])
+        stderr = Tempfile.new([basename, ".stderr.log"])
+        begin
+          yield(pdf, stdout, stderr)
+        ensure
+          pdf.close!
+          stdout.close!
+          stderr.close!
+        end
+      end
+
+      def log_tag
+        "[decomposer][abiword]"
+      end
+    end
+  end
+end

  Added: test/fixture/abw/multi-pages.abw (+43 -0) 100644
===================================================================
--- /dev/null
+++ test/fixture/abw/multi-pages.abw    2019-06-13 16:21:09 +0900 (bd6674b)
@@ -0,0 +1,43 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<!DOCTYPE abiword PUBLIC "-//ABISOURCE//DTD AWML 1.0 Strict//EN" "http://www.abisource.com/awml.dtd">
+<abiword template="false" xmlns:ct="http://www.abisource.com/changetracking.dtd" xmlns:fo="http://www.w3.org/1999/XSL/Format" xmlns:math="http://www.w3.org/1998/Math/MathML" xid-max="4" xmlns:dc="http://purl.org/dc/elements/1.1/" fileformat="1.1" xmlns:svg="http://www.w3.org/2000/svg" xmlns:awml="http://www.abisource.com/awml.dtd" xmlns="http://www.abisource.com/awml.dtd" xmlns:xlink="http://www.w3.org/1999/xlink" version="3.0.2" xml:space="preserve" props="dom-dir:ltr; document-footnote-restart-section:0; document-endnote-type:numeric; document-endnote-place-enddoc:1; document-endnote-initial:1; lang:ja-JP; document-endnote-restart-section:0; document-footnote-restart-page:0; document-footnote-type:numeric; document-footnote-initial:1; document-endnote-place-endsection:0">
+<!-- ======================================================================== -->
+<!-- This file is an AbiWord document.                                        -->
+<!-- AbiWord is a free, Open Source word processor.                           -->
+<!-- More information about AbiWord is available at http://www.abisource.com/ -->
+<!-- You should not edit this file by hand.                                   -->
+<!-- ======================================================================== -->
+
+<metadata>
+<m key="abiword.date_last_changed">Thu Jun 13 16:15:17 2019
+</m>
+<m key="abiword.generator">AbiWord</m>
+<m key="dc.date">Thu Jun 13 16:15:17 2019
+</m>
+<m key="dc.format">application/x-abiword</m>
+<m key="meta:editing-cycles">1</m>
+<m key="meta:editing-duration">P0D</m>
+</metadata>
+<rdf>
+<t  s="styles.xml"  p="http://www.w3.org/1999/02/22-rdf-syntax-ns#type"  objecttype="1"  xsdtype=""  >http://docs.oasis-open.org/ns/office/1.2/meta/odf#StylesFile</t>
+<t  s="content.xml"  p="http://www.w3.org/1999/02/22-rdf-syntax-ns#type"  objecttype="1"  xsdtype=""  >http://docs.oasis-open.org/ns/office/1.2/meta/odf#ContentFile</t>
+<t  s="manifest.rdf"  p="http://docs.oasis-open.org/ns/office/1.2/meta/pkg#hasPart"  objecttype="1"  xsdtype=""  >styles.xml</t>
+<t  s="manifest.rdf"  p="http://docs.oasis-open.org/ns/office/1.2/meta/pkg#hasPart"  objecttype="1"  xsdtype=""  >content.xml</t>
+<t  s="manifest.rdf"  p="http://www.w3.org/1999/02/22-rdf-syntax-ns#type"  objecttype="1"  xsdtype=""  >http://docs.oasis-open.org/ns/office/1.2/meta/pkg#Document</t>
+</rdf>
+<history version="1" edit-time="16" last-saved="1560410117" uid="f5d2a67c-8daa-11e9-84b7-a514b42a119c">
+<version id="1" started="1560410117" uid="ff487a92-8daa-11e9-84b7-a514b42a119c" auto="0" top-xid="4"/>
+</history>
+<styles>
+<s type="P" name="Normal" props="lang:en-US; default-tab-interval:1.251cm; font-size:12pt; font-family:Liberation Serif; dom-dir:ltr"/>
+<s type="P" name="Caption" basedon="Normal" followedby="Caption" props="margin-top:0.212cm; font-size:12pt; margin-bottom:0.212cm; font-style:italic"/>
+<s type="P" name="Heading" basedon="Normal" followedby="Text body" props="margin-top:0.423cm; keep-with-next:yes; margin-bottom:0.212cm; font-family:Liberation Sans; font-size:14pt"/>
+<s type="P" name="Text body" basedon="Normal" followedby="Text body" props="margin-bottom:0.212cm; margin-top:0cm"/>
+</styles>
+<pagesize pagetype="A4" orientation="portrait" width="210.000000" height="297.000000" units="mm" page-scale="1.000000"/>
+<section xid="1" props="page-margin-right:2cm; page-width:21.001cm; page-margin-left:2cm; page-orientation:portrait; page-margin-bottom:2cm; page-margin-top:2cm; page-height:29.7cm">
+<p style="Normal" xid="2">Page1</p>
+<p xid="3"><pbr/></p>
+<p style="Normal" props="" xid="4">Page2</p>
+</section>
+</abiword>

  Added: test/fixture/abw/one-page.abw (+41 -0) 100644
===================================================================
--- /dev/null
+++ test/fixture/abw/one-page.abw    2019-06-13 16:21:09 +0900 (401718e)
@@ -0,0 +1,41 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<!DOCTYPE abiword PUBLIC "-//ABISOURCE//DTD AWML 1.0 Strict//EN" "http://www.abisource.com/awml.dtd">
+<abiword template="false" xmlns:ct="http://www.abisource.com/changetracking.dtd" xmlns:fo="http://www.w3.org/1999/XSL/Format" xmlns:math="http://www.w3.org/1998/Math/MathML" xid-max="2" xmlns:dc="http://purl.org/dc/elements/1.1/" fileformat="1.1" xmlns:svg="http://www.w3.org/2000/svg" xmlns:awml="http://www.abisource.com/awml.dtd" xmlns="http://www.abisource.com/awml.dtd" xmlns:xlink="http://www.w3.org/1999/xlink" version="3.0.2" xml:space="preserve" props="dom-dir:ltr; document-footnote-restart-section:0; document-endnote-type:numeric; document-endnote-place-enddoc:1; document-endnote-initial:1; lang:ja-JP; document-endnote-restart-section:0; document-footnote-restart-page:0; document-footnote-type:numeric; document-footnote-initial:1; document-endnote-place-endsection:0">
+<!-- ======================================================================== -->
+<!-- This file is an AbiWord document.                                        -->
+<!-- AbiWord is a free, Open Source word processor.                           -->
+<!-- More information about AbiWord is available at http://www.abisource.com/ -->
+<!-- You should not edit this file by hand.                                   -->
+<!-- ======================================================================== -->
+
+<metadata>
+<m key="abiword.date_last_changed">Thu Jun 13 16:15:31 2019
+</m>
+<m key="abiword.generator">AbiWord</m>
+<m key="dc.date">Thu Jun 13 16:15:31 2019
+</m>
+<m key="dc.format">application/x-abiword</m>
+<m key="meta:editing-cycles">1</m>
+<m key="meta:editing-duration">P0D</m>
+</metadata>
+<rdf>
+<t  s="styles.xml"  p="http://www.w3.org/1999/02/22-rdf-syntax-ns#type"  objecttype="1"  xsdtype=""  >http://docs.oasis-open.org/ns/office/1.2/meta/odf#StylesFile</t>
+<t  s="content.xml"  p="http://www.w3.org/1999/02/22-rdf-syntax-ns#type"  objecttype="1"  xsdtype=""  >http://docs.oasis-open.org/ns/office/1.2/meta/odf#ContentFile</t>
+<t  s="manifest.rdf"  p="http://docs.oasis-open.org/ns/office/1.2/meta/pkg#hasPart"  objecttype="1"  xsdtype=""  >styles.xml</t>
+<t  s="manifest.rdf"  p="http://docs.oasis-open.org/ns/office/1.2/meta/pkg#hasPart"  objecttype="1"  xsdtype=""  >content.xml</t>
+<t  s="manifest.rdf"  p="http://www.w3.org/1999/02/22-rdf-syntax-ns#type"  objecttype="1"  xsdtype=""  >http://docs.oasis-open.org/ns/office/1.2/meta/pkg#Document</t>
+</rdf>
+<history version="1" edit-time="7" last-saved="1560410131" uid="03403cde-8dab-11e9-9847-bcd35e03657c">
+<version id="1" started="1560410131" uid="07b66e0a-8dab-11e9-9847-bcd35e03657c" auto="0" top-xid="2"/>
+</history>
+<styles>
+<s type="P" name="Normal" props="lang:en-US; default-tab-interval:1.251cm; font-size:12pt; font-family:Liberation Serif; dom-dir:ltr"/>
+<s type="P" name="Caption" basedon="Normal" followedby="Caption" props="margin-top:0.212cm; font-size:12pt; margin-bottom:0.212cm; font-style:italic"/>
+<s type="P" name="Heading" basedon="Normal" followedby="Text body" props="margin-top:0.423cm; keep-with-next:yes; margin-bottom:0.212cm; font-family:Liberation Sans; font-size:14pt"/>
+<s type="P" name="Text body" basedon="Normal" followedby="Text body" props="margin-bottom:0.212cm; margin-top:0cm"/>
+</styles>
+<pagesize pagetype="A4" orientation="portrait" width="210.000000" height="297.000000" units="mm" page-scale="1.000000"/>
+<section xid="1" props="page-margin-right:2cm; page-width:21.001cm; page-margin-left:2cm; page-orientation:portrait; page-margin-bottom:2cm; page-margin-top:2cm; page-height:29.7cm">
+<p style="Normal" xid="2">Page1</p>
+</section>
+</abiword>

  Added: test/fixture/doc/multi-pages.doc (+21 -0) 100644
===================================================================
--- /dev/null
+++ test/fixture/doc/multi-pages.doc    2019-06-13 16:21:09 +0900 (991047b)
@@ -0,0 +1,21 @@
+MIME-Version: 1.0
+mime-type: application/msword
+uri: file:/tmp/L8nSSf_multi-pages.doc
+path: /tmp/L8nSSf_multi-pages.doc
+size: 9216
+Content-Type: multipart/mixed; boundary=boundary
+
+--boundary
+mime-type: text/plain
+uri: file:/tmp/L8nSSf_multi-pages.txt
+path: /tmp/L8nSSf_multi-pages.txt
+size: 12
+created_time: 2019-06-13 07:21:25 UTC
+source-mime-types: ["application/pdf", "application/msword"]
+creator: Writer
+producer: LibreOffice 5.2
+
+Page1
+Page2
+
+--boundary--

  Added: test/fixture/doc/one-page.doc (+20 -0) 100644
===================================================================
--- /dev/null
+++ test/fixture/doc/one-page.doc    2019-06-13 16:21:09 +0900 (4216000)
@@ -0,0 +1,20 @@
+MIME-Version: 1.0
+mime-type: application/msword
+uri: file:/tmp/wLd11d_one-page.doc
+path: /tmp/wLd11d_one-page.doc
+size: 9216
+Content-Type: multipart/mixed; boundary=boundary
+
+--boundary
+mime-type: text/plain
+uri: file:/tmp/wLd11d_one-page.txt
+path: /tmp/wLd11d_one-page.txt
+size: 6
+created_time: 2019-06-13 07:21:27 UTC
+source-mime-types: ["application/pdf", "application/msword"]
+creator: Writer
+producer: LibreOffice 5.2
+
+Page1
+
+--boundary--

  Added: test/fixture/docx/multi-pages.docx (+21 -0) 100644
===================================================================
--- /dev/null
+++ test/fixture/docx/multi-pages.docx    2019-06-13 16:21:09 +0900 (14b17ff)
@@ -0,0 +1,21 @@
+MIME-Version: 1.0
+mime-type: application/vnd.openxmlformats-officedocument.wordprocessingml.document
+uri: file:/tmp/qL9jTF_multi-pages.docx
+path: /tmp/qL9jTF_multi-pages.docx
+size: 3889
+Content-Type: multipart/mixed; boundary=boundary
+
+--boundary
+mime-type: text/plain
+uri: file:/tmp/qL9jTF_multi-pages.txt
+path: /tmp/qL9jTF_multi-pages.txt
+size: 12
+created_time: 2014-01-05 15:35:56 UTC
+modified_time: 2014-01-05 15:36:34 UTC
+source-mime-types: ["application/vnd.openxmlformats-officedocument.wordprocessingml.document"]
+application: LibreOffice/4.1.5.3$Linux_X86_64 LibreOffice_project/410m0$Build-3
+
+Page1
+Page2
+
+--boundary--

  Added: test/fixture/docx/one-page.docx (+20 -0) 100644
===================================================================
--- /dev/null
+++ test/fixture/docx/one-page.docx    2019-06-13 16:21:09 +0900 (0b90a5e)
@@ -0,0 +1,20 @@
+MIME-Version: 1.0
+mime-type: application/vnd.openxmlformats-officedocument.wordprocessingml.document
+uri: file:/tmp/tJE5NG_one-page.docx
+path: /tmp/tJE5NG_one-page.docx
+size: 3871
+Content-Type: multipart/mixed; boundary=boundary
+
+--boundary
+mime-type: text/plain
+uri: file:/tmp/tJE5NG_one-page.txt
+path: /tmp/tJE5NG_one-page.txt
+size: 6
+created_time: 2014-01-05 15:34:49 UTC
+modified_time: 2014-01-05 15:35:24 UTC
+source-mime-types: ["application/vnd.openxmlformats-officedocument.wordprocessingml.document"]
+application: LibreOffice/4.1.5.3$Linux_X86_64 LibreOffice_project/410m0$Build-3
+
+Page1
+
+--boundary--

  Added: test/fixture/odt/multi-pages.odt (+21 -0) 100644
===================================================================
--- /dev/null
+++ test/fixture/odt/multi-pages.odt    2019-06-13 16:21:09 +0900 (c21c3ca)
@@ -0,0 +1,21 @@
+MIME-Version: 1.0
+mime-type: application/vnd.oasis.opendocument.text
+uri: file:/tmp/A3wPvH_multi-pages.odt
+path: /tmp/A3wPvH_multi-pages.odt
+size: 7874
+Content-Type: multipart/mixed; boundary=boundary
+
+--boundary
+mime-type: text/plain
+uri: file:/tmp/A3wPvH_multi-pages.txt
+path: /tmp/A3wPvH_multi-pages.txt
+size: 12
+created_time: 2014-01-05 15:35:56 +0900
+modified_time: 2014-01-05 15:36:34 +0900
+source-mime-types: ["application/vnd.oasis.opendocument.text"]
+generator: LibreOffice/4.1.4.2$Linux_X86_64 LibreOffice_project/410m0$Build-2
+
+Page1
+Page2
+
+--boundary--

  Added: test/fixture/odt/one-page.odt (+20 -0) 100644
===================================================================
--- /dev/null
+++ test/fixture/odt/one-page.odt    2019-06-13 16:21:09 +0900 (a77f44e)
@@ -0,0 +1,20 @@
+MIME-Version: 1.0
+mime-type: application/vnd.oasis.opendocument.text
+uri: file:/tmp/5tbJRE_one-page.odt
+path: /tmp/5tbJRE_one-page.odt
+size: 7662
+Content-Type: multipart/mixed; boundary=boundary
+
+--boundary
+mime-type: text/plain
+uri: file:/tmp/5tbJRE_one-page.txt
+path: /tmp/5tbJRE_one-page.txt
+size: 6
+created_time: 2014-01-05 15:34:49 +0900
+modified_time: 2014-01-05 15:35:24 +0900
+source-mime-types: ["application/vnd.oasis.opendocument.text"]
+generator: LibreOffice/4.1.4.2$Linux_X86_64 LibreOffice_project/410m0$Build-2
+
+Page1
+
+--boundary--

  Added: test/fixture/rtf/multi-pages.rtf (+19 -0) 100644
===================================================================
--- /dev/null
+++ test/fixture/rtf/multi-pages.rtf    2019-06-13 16:21:09 +0900 (1427a05)
@@ -0,0 +1,19 @@
+{\rtf1\ansi\deff3\adeflang1025
+{\fonttbl{\f0\froman\fprq2\fcharset0 Times New Roman;}{\f1\froman\fprq2\fcharset2 Symbol;}{\f2\fswiss\fprq2\fcharset0 Arial;}{\f3\froman\fprq2\fcharset0 Liberation Serif{\*\falt Times New Roman};}{\f4\fswiss\fprq2\fcharset0 Liberation Sans{\*\falt Arial};}{\f5\fnil\fprq2\fcharset128 \'83\'82\'83\'67\'83\'84L\'83\'7d\'83\'8b\'83\'78\'83\'8a3\'93\'99\'95\'9d;}{\f6\fnil\fprq2\fcharset0 Lohit Devanagari;}{\f7\fnil\fprq0\fcharset0 Lohit Devanagari;}}
+{\colortbl;\red0\green0\blue0;\red0\green0\blue255;\red0\green255\blue255;\red0\green255\blue0;\red255\green0\blue255;\red255\green0\blue0;\red255\green255\blue0;\red255\green255\blue255;\red0\green0\blue128;\red0\green128\blue128;\red0\green128\blue0;\red128\green0\blue128;\red128\green0\blue0;\red128\green128\blue0;\red128\green128\blue128;\red192\green192\blue192;}
+{\stylesheet{\s0\snext0\nowidctlpar\hyphpar0\aspalpha\ltrpar\cf0\kerning1\dbch\af5\langfe1041\dbch\af6\afs24\alang1081\loch\f3\hich\af3\fs24\lang1033 Normal;}
+{\s15\sbasedon0\snext16\sb240\sa120\keepn\dbch\af5\dbch\af6\afs28\loch\f4\fs28 \u35211\'3f\u20986\'3f\u12375\'3f;}
+{\s16\sbasedon0\snext16\sb0\sa120 \u26412\'3f\u25991\'3f;}
+{\s17\sbasedon16\snext17\sb0\sa120\dbch\af7 \u12522\'3f\u12473\'3f\u12488\'3f;}
+{\s18\sbasedon0\snext18\sb120\sa120\noline\i\dbch\af7\afs24\ai\fs24 \u12461\'3f\u12515\'3f\u12503\'3f\u12471\'3f\u12519\'3f\u12531\'3f;}
+{\s19\sbasedon0\snext19\noline\dbch\af7 \u32034\'3f\u24341\'3f;}
+}{\*\generator LibreOffice/6.1.5.2$Linux_X86_64 LibreOffice_project/10$Build-2}{\info{\creatim\yr2014\mo1\dy5\hr15\min35}{\revtim\yr2014\mo1\dy5\hr15\min36}{\printim\yr0\mo0\dy0\hr0\min0}}{\*\userprops}\deftab709
+\viewscale110
+{\*\pgdsctbl
+{\pgdsc0\pgdscuse451\pgwsxn11906\pghsxn16838\marglsxn1134\margrsxn1134\margtsxn1134\margbsxn1134\pgdscnxt0 \u27161\'3f\u28310\'3f\u12473\'3f\u12479\'3f\u12452\'3f\u12523\'3f;}}
+\formshade\paperh16838\paperw11906\margl1134\margr1134\margt1134\margb1134\sectd\sbknone\sectunlocked1\pgndec\pgwsxn11906\pghsxn16838\marglsxn1134\margrsxn1134\margtsxn1134\margbsxn1134\ftnbj\ftnstart1\ftnrstcont\ftnnar\aenddoc\aftnrstcont\aftnstart1\aftnnrlc
+{\*\ftnsep\chftnsep}\pgndec\pard\plain \s0\nowidctlpar\hyphpar0\aspalpha\ltrpar\cf0\kerning1\dbch\af5\langfe1041\dbch\af6\afs24\alang1081\loch\f3\hich\af3\fs24\lang1033{\rtlch \ltrch\loch
+Page1}
+\par \pard\plain \s0\nowidctlpar\hyphpar0\aspalpha\ltrpar\cf0\kerning1\dbch\af5\langfe1041\dbch\af6\afs24\alang1081\loch\f3\hich\af3\fs24\lang1033\pagebb{\rtlch \ltrch\loch
+Page2}
+\par }
\ No newline at end of file

  Added: test/fixture/rtf/one-page.rtf (+17 -0) 100644
===================================================================
--- /dev/null
+++ test/fixture/rtf/one-page.rtf    2019-06-13 16:21:09 +0900 (3a00494)
@@ -0,0 +1,17 @@
+{\rtf1\ansi\deff3\adeflang1025
+{\fonttbl{\f0\froman\fprq2\fcharset0 Times New Roman;}{\f1\froman\fprq2\fcharset2 Symbol;}{\f2\fswiss\fprq2\fcharset0 Arial;}{\f3\froman\fprq2\fcharset0 Liberation Serif{\*\falt Times New Roman};}{\f4\fswiss\fprq2\fcharset0 Liberation Sans{\*\falt Arial};}{\f5\fnil\fprq2\fcharset128 \'83\'82\'83\'67\'83\'84L\'83\'7d\'83\'8b\'83\'78\'83\'8a3\'93\'99\'95\'9d;}{\f6\fnil\fprq2\fcharset0 Lohit Devanagari;}{\f7\fnil\fprq0\fcharset0 Lohit Devanagari;}}
+{\colortbl;\red0\green0\blue0;\red0\green0\blue255;\red0\green255\blue255;\red0\green255\blue0;\red255\green0\blue255;\red255\green0\blue0;\red255\green255\blue0;\red255\green255\blue255;\red0\green0\blue128;\red0\green128\blue128;\red0\green128\blue0;\red128\green0\blue128;\red128\green0\blue0;\red128\green128\blue0;\red128\green128\blue128;\red192\green192\blue192;}
+{\stylesheet{\s0\snext0\nowidctlpar\hyphpar0\aspalpha\ltrpar\cf0\kerning1\dbch\af5\langfe1041\dbch\af6\afs24\alang1081\loch\f3\hich\af3\fs24\lang1033 Normal;}
+{\s15\sbasedon0\snext16\sb240\sa120\keepn\dbch\af5\dbch\af6\afs28\loch\f4\fs28 \u35211\'3f\u20986\'3f\u12375\'3f;}
+{\s16\sbasedon0\snext16\sb0\sa120 \u26412\'3f\u25991\'3f;}
+{\s17\sbasedon16\snext17\sb0\sa120\dbch\af7 \u12522\'3f\u12473\'3f\u12488\'3f;}
+{\s18\sbasedon0\snext18\sb120\sa120\noline\i\dbch\af7\afs24\ai\fs24 \u12461\'3f\u12515\'3f\u12503\'3f\u12471\'3f\u12519\'3f\u12531\'3f;}
+{\s19\sbasedon0\snext19\noline\dbch\af7 \u32034\'3f\u24341\'3f;}
+}{\*\generator LibreOffice/6.1.5.2$Linux_X86_64 LibreOffice_project/10$Build-2}{\info{\creatim\yr2014\mo1\dy5\hr15\min34}{\revtim\yr2014\mo1\dy5\hr15\min35}{\printim\yr0\mo0\dy0\hr0\min0}}{\*\userprops}\deftab709
+\viewscale110
+{\*\pgdsctbl
+{\pgdsc0\pgdscuse451\pgwsxn11906\pghsxn16838\marglsxn1134\margrsxn1134\margtsxn1134\margbsxn1134\pgdscnxt0 \u27161\'3f\u28310\'3f\u12473\'3f\u12479\'3f\u12452\'3f\u12523\'3f;}}
+\formshade\paperh16838\paperw11906\margl1134\margr1134\margt1134\margb1134\sectd\sbknone\sectunlocked1\pgndec\pgwsxn11906\pghsxn16838\marglsxn1134\margrsxn1134\margtsxn1134\margbsxn1134\ftnbj\ftnstart1\ftnrstcont\ftnnar\aenddoc\aftnrstcont\aftnstart1\aftnnrlc
+{\*\ftnsep\chftnsep}\pgndec\pard\plain \s0\nowidctlpar\hyphpar0\aspalpha\ltrpar\cf0\kerning1\dbch\af5\langfe1041\dbch\af6\afs24\alang1081\loch\f3\hich\af3\fs24\lang1033{\rtlch \ltrch\loch
+Page1}
+\par }
\ No newline at end of file

  Added: test/fixture/zabw/multi-pages.zabw (+0 -0) 100644
===================================================================
(Binary files differ)

  Added: test/fixture/zabw/one-page.zabw (+0 -0) 100644
===================================================================
(Binary files differ)

  Added: test/helper.rb (+57 -0) 100644
===================================================================
--- /dev/null
+++ test/helper.rb    2019-06-13 16:21:09 +0900 (33955f7)
@@ -0,0 +1,57 @@
+# Copyright (C) 2019  Kouhei Sutou <kou****@clear*****>
+#
+# This library is free software; you can redistribute it and/or
+# modify it under the terms of the GNU Lesser General Public
+# License as published by the Free Software Foundation; either
+# version 2.1 of the License, or (at your option) any later version.
+#
+# This library is distributed in the hope that it will be useful,
+# but WITHOUT ANY WARRANTY; without even the implied warranty of
+# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
+# Lesser General Public License for more details.
+#
+# You should have received a copy of the GNU Lesser General Public
+# License along with this library; if not, write to the Free Software
+# Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA  02110-1301  USA
+
+require "pathname"
+
+module FixtureHelper
+  def fixture_path(*components)
+    base_path = Pathname(__dir__) + "fixture"
+    base_path.join(*components)
+  end
+end
+
+module DecomposeHelper
+  def decompose(path)
+    data = ChupaText::InputData.new(path)
+
+    pdf_decomposer = ChupaText::Decomposers::PDF.new({})
+    decomposed = []
+    @decomposer.decompose(data) do |decomposed_data|
+      if pdf_decomposer.target?(decomposed_data)
+        pdf_decomposer.decompose(decomposed_data) do |pdf_decomposed_data|
+          decomposed << pdf_decomposed_data
+        end
+      else
+        decomposed << decomposed_data
+      end
+    end
+    decomposed
+  end
+
+  def normalize_producers(producers)
+    producers.collect do |producer|
+      normalize_producer(producer)
+    end
+  end
+
+  def normalize_producer(producer)
+    if /\Acairo \d+\.\d+\.\d+ \(https:\/\/cairographics\.org\)\z/ =~ producer
+      "cairo"
+    else
+      producer
+    end
+  end
+end

  Added: test/run-test.rb (+31 -0) 100755
===================================================================
--- /dev/null
+++ test/run-test.rb    2019-06-13 16:21:09 +0900 (42ef09a)
@@ -0,0 +1,31 @@
+#!/usr/bin/env ruby
+#
+# Copyright (C) 2019  Sutou Kouhei <kou****@clear*****>
+#
+# This library is free software; you can redistribute it and/or
+# modify it under the terms of the GNU Lesser General Public
+# License as published by the Free Software Foundation; either
+# version 2.1 of the License, or (at your option) any later version.
+#
+# This library is distributed in the hope that it will be useful,
+# but WITHOUT ANY WARRANTY; without even the implied warranty of
+# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
+# Lesser General Public License for more details.
+#
+# You should have received a copy of the GNU Lesser General Public
+# License along with this library; if not, write to the Free Software
+# Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA  02110-1301  USA
+
+$VERBOSE = true
+
+require "bundler/setup"
+
+require "test-unit"
+
+require "chupa-text"
+
+ChupaText::Decomposers.load
+
+require_relative "helper"
+
+exit(Test::Unit::AutoRunner.run(true))

  Added: test/test-abw.rb (+84 -0) 100644
===================================================================
--- /dev/null
+++ test/test-abw.rb    2019-06-13 16:21:09 +0900 (ace5ca1)
@@ -0,0 +1,84 @@
+# Copyright (C) 2019  Sutou Kouhei <kou****@clear*****>
+#
+# This library is free software; you can redistribute it and/or
+# modify it under the terms of the GNU Lesser General Public
+# License as published by the Free Software Foundation; either
+# version 2.1 of the License, or (at your option) any later version.
+#
+# This library is distributed in the hope that it will be useful,
+# but WITHOUT ANY WARRANTY; without even the implied warranty of
+# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
+# Lesser General Public License for more details.
+#
+# You should have received a copy of the GNU Lesser General Public
+# License along with this library; if not, write to the Free Software
+# Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA  02110-1301  USA
+
+class TestAbw < Test::Unit::TestCase
+  include FixtureHelper
+
+  def setup
+    @decomposer = ChupaText::Decomposers::AbiWord.new({})
+  end
+
+  def fixture_path(*components)
+    super("abw", *components)
+  end
+
+  sub_test_case("target?") do
+    sub_test_case("extension") do
+      def create_data(uri)
+        data = ChupaText::Data.new
+        data.body = ""
+        data.uri = uri
+        data
+      end
+
+      def test_doc
+        assert_true(@decomposer.target?(create_data("document.abw")))
+      end
+    end
+
+    sub_test_case("mime-type") do
+      def create_data(mime_type)
+        data = ChupaText::Data.new
+        data.mime_type = mime_type
+        data
+      end
+
+      def test_abiword
+        mime_type = "application/x-abiword"
+        assert_true(@decomposer.target?(create_data(mime_type)))
+      end
+    end
+  end
+
+  sub_test_case("decompose") do
+    include DecomposeHelper
+
+    sub_test_case("one page") do
+      def test_body
+        assert_equal(["Page1\n"], decompose.collect(&:body))
+      end
+
+      private
+      def decompose
+        super(fixture_path("one-page.abw"))
+      end
+    end
+
+    sub_test_case("multi pages") do
+      def test_body
+        assert_equal([<<-BODY], decompose.collect(&:body))
+Page1
+Page2
+        BODY
+      end
+
+      private
+      def decompose
+        super(fixture_path("multi-pages.abw"))
+      end
+    end
+  end
+end

  Added: test/test-doc.rb (+84 -0) 100644
===================================================================
--- /dev/null
+++ test/test-doc.rb    2019-06-13 16:21:09 +0900 (24d8407)
@@ -0,0 +1,84 @@
+# Copyright (C) 2019  Sutou Kouhei <kou****@clear*****>
+#
+# This library is free software; you can redistribute it and/or
+# modify it under the terms of the GNU Lesser General Public
+# License as published by the Free Software Foundation; either
+# version 2.1 of the License, or (at your option) any later version.
+#
+# This library is distributed in the hope that it will be useful,
+# but WITHOUT ANY WARRANTY; without even the implied warranty of
+# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
+# Lesser General Public License for more details.
+#
+# You should have received a copy of the GNU Lesser General Public
+# License along with this library; if not, write to the Free Software
+# Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA  02110-1301  USA
+
+class TestDoc < Test::Unit::TestCase
+  include FixtureHelper
+
+  def setup
+    @decomposer = ChupaText::Decomposers::AbiWord.new({})
+  end
+
+  def fixture_path(*components)
+    super("doc", *components)
+  end
+
+  sub_test_case("target?") do
+    sub_test_case("extension") do
+      def create_data(uri)
+        data = ChupaText::Data.new
+        data.body = ""
+        data.uri = uri
+        data
+      end
+
+      def test_doc
+        assert_true(@decomposer.target?(create_data("document.doc")))
+      end
+    end
+
+    sub_test_case("mime-type") do
+      def create_data(mime_type)
+        data = ChupaText::Data.new
+        data.mime_type = mime_type
+        data
+      end
+
+      def test_ms_word
+        mime_type = "application/msword"
+        assert_true(@decomposer.target?(create_data(mime_type)))
+      end
+    end
+  end
+
+  sub_test_case("decompose") do
+    include DecomposeHelper
+
+    sub_test_case("one page") do
+      def test_body
+        assert_equal(["Page1\n"], decompose.collect(&:body))
+      end
+
+      private
+      def decompose
+        super(fixture_path("one-page.doc"))
+      end
+    end
+
+    sub_test_case("multi pages") do
+      def test_body
+        assert_equal([<<-BODY], decompose.collect(&:body))
+Page1
+Page2
+        BODY
+      end
+
+      private
+      def decompose
+        super(fixture_path("multi-pages.doc"))
+      end
+    end
+  end
+end

  Added: test/test-docx.rb (+84 -0) 100644
===================================================================
--- /dev/null
+++ test/test-docx.rb    2019-06-13 16:21:09 +0900 (a4f2196)
@@ -0,0 +1,84 @@
+# Copyright (C) 2019  Sutou Kouhei <kou****@clear*****>
+#
+# This library is free software; you can redistribute it and/or
+# modify it under the terms of the GNU Lesser General Public
+# License as published by the Free Software Foundation; either
+# version 2.1 of the License, or (at your option) any later version.
+#
+# This library is distributed in the hope that it will be useful,
+# but WITHOUT ANY WARRANTY; without even the implied warranty of
+# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
+# Lesser General Public License for more details.
+#
+# You should have received a copy of the GNU Lesser General Public
+# License along with this library; if not, write to the Free Software
+# Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA  02110-1301  USA
+
+class TestDocx < Test::Unit::TestCase
+  include FixtureHelper
+
+  def setup
+    @decomposer = ChupaText::Decomposers::AbiWord.new({})
+  end
+
+  def fixture_path(*components)
+    super("docx", *components)
+  end
+
+  sub_test_case("target?") do
+    sub_test_case("extension") do
+      def create_data(uri)
+        data = ChupaText::Data.new
+        data.body = ""
+        data.uri = uri
+        data
+      end
+
+      def test_doc
+        assert_true(@decomposer.target?(create_data("document.docx")))
+      end
+    end
+
+    sub_test_case("mime-type") do
+      def create_data(mime_type)
+        data = ChupaText::Data.new
+        data.mime_type = mime_type
+        data
+      end
+
+      def test_openxml_document
+        mime_type = "application/vnd.openxmlformats-officedocument.wordprocessingml.document"
+        assert_true(@decomposer.target?(create_data(mime_type)))
+      end
+    end
+  end
+
+  sub_test_case("decompose") do
+    include DecomposeHelper
+
+    sub_test_case("one page") do
+      def test_body
+        assert_equal(["Page1\n"], decompose.collect(&:body))
+      end
+
+      private
+      def decompose
+        super(fixture_path("one-page.docx"))
+      end
+    end
+
+    sub_test_case("multi pages") do
+      def test_body
+        assert_equal([<<-BODY], decompose.collect(&:body))
+Page1
+Page2
+        BODY
+      end
+
+      private
+      def decompose
+        super(fixture_path("multi-pages.docx"))
+      end
+    end
+  end
+end

  Added: test/test-odt.rb (+84 -0) 100644
===================================================================
--- /dev/null
+++ test/test-odt.rb    2019-06-13 16:21:09 +0900 (515a9a6)
@@ -0,0 +1,84 @@
+# Copyright (C) 2019  Sutou Kouhei <kou****@clear*****>
+#
+# This library is free software; you can redistribute it and/or
+# modify it under the terms of the GNU Lesser General Public
+# License as published by the Free Software Foundation; either
+# version 2.1 of the License, or (at your option) any later version.
+#
+# This library is distributed in the hope that it will be useful,
+# but WITHOUT ANY WARRANTY; without even the implied warranty of
+# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
+# Lesser General Public License for more details.
+#
+# You should have received a copy of the GNU Lesser General Public
+# License along with this library; if not, write to the Free Software
+# Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA  02110-1301  USA
+
+class TestOdt < Test::Unit::TestCase
+  include FixtureHelper
+
+  def setup
+    @decomposer = ChupaText::Decomposers::AbiWord.new({})
+  end
+
+  def fixture_path(*components)
+    super("odt", *components)
+  end
+
+  sub_test_case("target?") do
+    sub_test_case("extension") do
+      def create_data(uri)
+        data = ChupaText::Data.new
+        data.body = ""
+        data.uri = uri
+        data
+      end
+
+      def test_doc
+        assert_true(@decomposer.target?(create_data("document.odt")))
+      end
+    end
+
+    sub_test_case("mime-type") do
+      def create_data(mime_type)
+        data = ChupaText::Data.new
+        data.mime_type = mime_type
+        data
+      end
+
+      def test_opendocument_text
+        mime_type = "application/vnd.oasis.opendocument.text"
+        assert_true(@decomposer.target?(create_data(mime_type)))
+      end
+    end
+  end
+
+  sub_test_case("decompose") do
+    include DecomposeHelper
+
+    sub_test_case("one page") do
+      def test_body
+        assert_equal(["Page1\n"], decompose.collect(&:body))
+      end
+
+      private
+      def decompose
+        super(fixture_path("one-page.odt"))
+      end
+    end
+
+    sub_test_case("multi pages") do
+      def test_body
+        assert_equal([<<-BODY], decompose.collect(&:body))
+Page1
+Page2
+        BODY
+      end
+
+      private
+      def decompose
+        super(fixture_path("multi-pages.odt"))
+      end
+    end
+  end
+end

  Added: test/test-rtf.rb (+84 -0) 100644
===================================================================
--- /dev/null
+++ test/test-rtf.rb    2019-06-13 16:21:09 +0900 (8d8793f)
@@ -0,0 +1,84 @@
+# Copyright (C) 2019  Sutou Kouhei <kou****@clear*****>
+#
+# This library is free software; you can redistribute it and/or
+# modify it under the terms of the GNU Lesser General Public
+# License as published by the Free Software Foundation; either
+# version 2.1 of the License, or (at your option) any later version.
+#
+# This library is distributed in the hope that it will be useful,
+# but WITHOUT ANY WARRANTY; without even the implied warranty of
+# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
+# Lesser General Public License for more details.
+#
+# You should have received a copy of the GNU Lesser General Public
+# License along with this library; if not, write to the Free Software
+# Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA  02110-1301  USA
+
+class TestRtf < Test::Unit::TestCase
+  include FixtureHelper
+
+  def setup
+    @decomposer = ChupaText::Decomposers::AbiWord.new({})
+  end
+
+  def fixture_path(*components)
+    super("rtf", *components)
+  end
+
+  sub_test_case("target?") do
+    sub_test_case("extension") do
+      def create_data(uri)
+        data = ChupaText::Data.new
+        data.body = ""
+        data.uri = uri
+        data
+      end
+
+      def test_doc
+        assert_true(@decomposer.target?(create_data("document.rtf")))
+      end
+    end
+
+    sub_test_case("mime-type") do
+      def create_data(mime_type)
+        data = ChupaText::Data.new
+        data.mime_type = mime_type
+        data
+      end
+
+      def test_rich_text_format
+        mime_type = "application/rtf"
+        assert_true(@decomposer.target?(create_data(mime_type)))
+      end
+    end
+  end
+
+  sub_test_case("decompose") do
+    include DecomposeHelper
+
+    sub_test_case("one page") do
+      def test_body
+        assert_equal(["Page1\n"], decompose.collect(&:body))
+      end
+
+      private
+      def decompose
+        super(fixture_path("one-page.rtf"))
+      end
+    end
+
+    sub_test_case("multi pages") do
+      def test_body
+        assert_equal([<<-BODY], decompose.collect(&:body))
+Page1
+Page2
+        BODY
+      end
+
+      private
+      def decompose
+        super(fixture_path("multi-pages.rtf"))
+      end
+    end
+  end
+end

  Added: test/test-zabw.rb (+71 -0) 100644
===================================================================
--- /dev/null
+++ test/test-zabw.rb    2019-06-13 16:21:09 +0900 (193866c)
@@ -0,0 +1,71 @@
+# Copyright (C) 2019  Sutou Kouhei <kou****@clear*****>
+#
+# This library is free software; you can redistribute it and/or
+# modify it under the terms of the GNU Lesser General Public
+# License as published by the Free Software Foundation; either
+# version 2.1 of the License, or (at your option) any later version.
+#
+# This library is distributed in the hope that it will be useful,
+# but WITHOUT ANY WARRANTY; without even the implied warranty of
+# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU
+# Lesser General Public License for more details.
+#
+# You should have received a copy of the GNU Lesser General Public
+# License along with this library; if not, write to the Free Software
+# Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA  02110-1301  USA
+
+class TestZabw < Test::Unit::TestCase
+  include FixtureHelper
+
+  def setup
+    @decomposer = ChupaText::Decomposers::AbiWord.new({})
+  end
+
+  def fixture_path(*components)
+    super("zabw", *components)
+  end
+
+  sub_test_case("target?") do
+    sub_test_case("extension") do
+      def create_data(uri)
+        data = ChupaText::Data.new
+        data.body = ""
+        data.uri = uri
+        data
+      end
+
+      def test_doc
+        assert_true(@decomposer.target?(create_data("document.zabw")))
+      end
+    end
+  end
+
+  sub_test_case("decompose") do
+    include DecomposeHelper
+
+    sub_test_case("one page") do
+      def test_body
+        assert_equal(["Page1\n"], decompose.collect(&:body))
+      end
+
+      private
+      def decompose
+        super(fixture_path("one-page.zabw"))
+      end
+    end
+
+    sub_test_case("multi pages") do
+      def test_body
+        assert_equal([<<-BODY], decompose.collect(&:body))
+Page1
+Page2
+        BODY
+      end
+
+      private
+      def decompose
+        super(fixture_path("multi-pages.zabw"))
+      end
+    end
+  end
+end


More information about the Groonga-commit mailing list
Back to archive index