Find a file
2012-09-14 11:22:21 +01:00
bin Useful performance testing example 2009-05-01 19:58:12 +01:00
doc updated doxygen config 2010-01-01 23:03:56 +00:00
examples Add PARSER_LIBS into link 2011-11-25 18:47:54 +00:00
include Add this-> 2012-09-14 11:22:21 +01:00
m4 Fixed includes for DOMException - all this time they'd been wrong :( 2011-11-26 09:19:56 +00:00
src Trying to get entity resolution going in Libxml2 2011-12-16 17:40:06 +00:00
tests Updated taggle test for new Writer 2011-11-25 21:45:55 +00:00
vs7 Added ARABICA_ prefix to parser selection macros 2012-09-14 11:11:11 +01:00
vs8 Added back vs7 and vs8 dirs, with explanatory readmes 2010-10-22 20:05:00 +01:00
vs9 Added -D NO_BOOST to release config 2012-05-22 16:27:41 +01:00
vs10 ignore .o 2012-09-14 11:13:13 +01:00
.bzrignore Setup DOM conformance files path so we can find them when running under hudson 2010-12-15 20:10:49 +00:00
.gitignore ignore .o 2012-09-14 11:13:13 +01:00
arabica.pc.in install headers into arabica subdirectory 2009-11-21 22:08:41 +00:00
AUTHORS sept 2006 release 2006-09-12 21:21:48 +00:00
autogen.sh sept 2006 release 2006-09-12 21:21:48 +00:00
ChangeLog Updated to point at loggerhead browser 2010-01-13 23:15:36 +00:00
config.guess updated libtool 2009-07-31 17:46:11 +01:00
config.sub updated libtool 2009-07-31 17:46:11 +01:00
configure.ac Massage ArabicaConfig.hpp so all macros are prefixed ARABICA_ 2011-11-25 23:26:39 +00:00
COPYING minor text files update 2008-04-07 21:13:28 +00:00
depcomp sept 2006 release 2006-09-12 21:21:48 +00:00
INSTALL sept 2006 release 2006-09-12 21:21:48 +00:00
install-sh updated libtool 2009-07-31 17:46:11 +01:00
ltmain.sh Fixed includes for DOMException - all this time they'd been wrong :( 2011-11-26 09:19:56 +00:00
Makefile.am Include DOM conformance tests in dist package 2010-12-09 14:50:38 +00:00
missing sept 2006 release 2006-09-12 21:21:48 +00:00
NEWS minor text files update 2008-04-07 21:13:28 +00:00
README Renamed 2012-09-07 19:35:41 +01:00

Arabica
=======

*Arabica is an XML and HTML processing toolkit*, providing *SAX2*, *DOM*, *XPath*, and *XSLT* implementations, written in *Standard C++*

* *SAX* is an event-based XML processing API. Arabica is a full SAX2 implementation, including the optional interfaces and helper classes. It provides uniform SAX2 wrappers for the Expat parser, Xerces, Libxml2 and, on Windows, for the Microsoft XML parser.
* The *DOM* is a platform- and language-neutral interface which models an XML document as a tree of nodes, defined by the W3C. Arabica implements the DOM Level 2 Core on top of the SAX layer.
* *XPath* is a language for addressing parts of an XML document. Arabica implements XPath 1.0 over its DOM implementation.
* *XSLT* is a language for transforming XML documents into other XML documents. Arabica builds XSLT over its XPath engine.
* In addition to the XML parser, Arabica includes Taggle, an *HTML parser* derived from TagSoup. 

Arabica is written in Standard C++ and should be portable to most platforms. It is parameterised on string type. Out of the box, it can provide UTF-8 encoded std::strings or UTF-16 encoded std::wstrings, but can easily be customised for arbitrary string types.

Arabica is available under a BSD-style license.

For latest Arabica news, please see http://www.jezuk.co.uk/arabica
For build notes see http://www.jezuk.co.uk/arabica/howtobuild