all options
buster  ] [  bullseye  ] [  bookworm  ] [  trixie  ] [  sid  ]
[ Source: htmlcxx  ]

Package: libhtmlcxx3v5 (0.87-4 and others)

Links for libhtmlcxx3v5

Screenshot

Debian Resources:

Download Source Package htmlcxx:

Maintainers:

External Resources:

Similar packages:

simple HTML parser library for C++

htmlcxx is a simple non-validating CSS1 and HTML parser for C++. Although there are several other html parsers available, htmlcxx has some characteristics that make it unique:

 * STL like navigation of DOM tree, using excellent tree.hh library from
   Kasper Peeters
 * It is possible to reproduce exactly, character by character, the original
   document from the parse tree
 * Bundled CSS parser
 * Optional parsing of attributes
 * C++ code that looks like C++ (not so true anymore)
 * Offsets of tags/elements in the original document are stored in the nodes
   of the DOM tree

The parsing politics of htmlcxx were created trying to mimic Mozilla Firefox (http://www.mozilla.org) behavior. So you should expect parse trees similar to those create by Firefox. However, differently from Firefox, htmlcxx does not insert non-existent stuff in your html. Therefore, serializing the DOM tree gives exactly the same bytes contained in the original HTML document.

Other Packages Related to libhtmlcxx3v5

  • depends
  • recommends
  • suggests
  • enhances

Download libhtmlcxx3v5

Download for all available architectures
Architecture Version Package Size Installed Size Files
amd64 0.87-4+b1 31.7 kB115.0 kB [list of files]
arm64 0.87-4+b1 28.7 kB151.0 kB [list of files]
armel 0.87-4+b1 26.2 kB86.0 kB [list of files]
armhf 0.87-4+b1 26.5 kB74.0 kB [list of files]
i386 0.87-4+b1 33.1 kB102.0 kB [list of files]
mips64el 0.87-4+b1 29.0 kB155.0 kB [list of files]
ppc64el 0.87-4+b1 31.9 kB151.0 kB [list of files]
riscv64 0.87-4+b1 29.8 kB95.0 kB [list of files]
s390x 0.87-4+b1 30.1 kB111.0 kB [list of files]