Simple Project List Software Map

Text Processing
1877 projects in result set
LastUpdate: 2004-08-02 18:37

html2db

html2db.xsl converts an XHTML source document into a Docbook output document. It provides features for customizing the generation of the output, so that the output can be tuned by annotating the source, rather than hand-editing the output. This makes it useful in a processing pipeline where the source documents are maintained in HTML, although it can be used as a one-time conversion tool too.

(Machine Translation)
LastUpdate: 2003-09-22 16:27

TDHkit

TDHkit is a set of programs and filters which are
useful in working with whitespace-delimited ASCII
data from the commandline or in shell scripts. It
was developed to supplement standard Unix
utilities such as sort and uniq, for purposes such
as selecting records, selecting fields, relational
joins, reformatting dates, etc. Fields can be
manipulated by name if data files have a field
name header.

(Machine Translation)
LastUpdate: 2004-06-15 20:21

XML Binary Infoset Encoding

XML Binary Infoset Encoding (XBIS) is an encoding designed to eliminate most of the padding of XML text documents being passed between programs, while being faster to generate and interpret. The focus is more on speed than on size, so if document size is the major concern standard compression algorithms can offer superior results. The current Java implementation shows 4-8X performance benefits over standard XML parsers over a range of document types and sizes and across JVMs tested.

(Machine Translation)
LastUpdate: 2003-09-13 11:12

Colorer Library

Colorer Library provides source text syntax highlighting and text
parsing services for host applications. It colorizes source code on
host editor systems in more than 100 formats. It uses the powerful HRC
format (XML, regexp, context-free grammars), allowing it to support
any language. The parser can search and build lists of special text
tokens (function lists, syntax errors) and search and indent
programming language constructions (brackets, paired tags).

(Machine Translation)
LastUpdate: 2008-03-22 22:34

xmltoman

xmltoman and xmlmantohtml are two small scripts to
convert XML documents to man pages in groff format
or HTML. It features the usual man page items such
as "description", "options", "see also", etc.

(Machine Translation)
LastUpdate: 2001-01-30 06:13

LineFeed

Linefeed is a GTK graphical utility which offer an easy way to convert DOS text files to UNIX text files by removing all unwanted carriage return characters.

LastUpdate: 2007-06-05 15:12

Puno

Puno is a PHP module (PHP5 and Linux/Unix only)
that brings the OpenOffice.org UNO Programming
API to the PHP userspace. You can use it to write
scripts that create, modify, read, and save
OpenOffice.org documents (Writer, Spreadsheet,
and Drawing). You can export these documents in
various formats, such as PDF or HTML.

(Machine Translation)
LastUpdate: 2006-02-25 01:01

HTML::TableExtract

HTML::TableExtract is a Perl module that
simplifies the extraction of information from
tables within HTML documents. Tables, no matter
how nested or clustered, can be targeted
symbolically with column headers or by more
specific depth and count information.

(Machine Translation)
LastUpdate: 2001-01-30 06:12

FAQ PLAIN

FAQ PLAIN is a simple FAQ preprocessor. It generates a single FAQ output page which can be used for HTML PLAIN, or as an include page. It offers a some very useful options such as a hierarchical structure of the FAQ with automatic numbering. The program is easy to use and greatly simplifies the task of creating an FAQ page.

(Machine Translation)
LastUpdate: 2006-10-06 14:17

hoglet

Hoglet allows special markup to be added to text
documents so that software documentation can be
easily produced. Hoglet provides a configurable
parser, simple markup rules, and extensible "tag
handlers" that allow custom Java code to process
content.

(Machine Translation)
LastUpdate: 2004-03-23 14:01

LaTeX support for NetBeans

LaTeX support for NetBeans is set of NetBeans
modules supporting easy editing of LaTeX source
files. A complete binary distribution is also
available. The features include code completion,
structure view, a spell checker, and many others.

(Machine Translation)
LastUpdate: 2002-10-15 19:11

pdoc

Pdoc is a Perl library which provides a number of methods to automate
the documentation procedure, parsing specific file formats and
converting them to other formats. It features Perl module parsing
capabilities and conversion to HTML.

(Machine Translation)
LastUpdate: 2011-01-01 15:48

Winnow

Winnow efficiently trains and operates any number of unique Bayesian (Naive Bayes) classifiers on large sets of content. It has very high performance and works with very small training and unbalanced training sets. It has been used to power an innovative Web feed reader that uses smart tags, which learn and find the content you want to see, from more sources than you can follow with traditional feed readers. It works particularly well with Ruby and Ruby on Rails.

LastUpdate: 2010-02-11 00:22

rssgen

rssgen is a PHP RSS generator. It does not require
a database, as the information is written directly
to an XML file. You can create new headlines,
modify existing ones, and preview how they will
appear.

(Machine Translation)
Natural Language: English
Programming Language: PHP
User Interface: Web Environment
LastUpdate: 2010-07-21 12:38

Quh

Quh is an audio player that cultivates many APIs
into a very simple and file operations inspired
framework. It aims to play everything that makes
noise (including reading different text formats
using speech synthesis).

(Machine Translation)