全部搜尋項
buster  ] [  bullseye  ] [  bookworm  ] [  trixie  ] [  sid  ]
[ 原始碼: tagsoup  ]

套件:libtagsoup-java(1.2.1+-1.1)

libtagsoup-java 的相關連結

Screenshot

Debian 的資源:

下載原始碼套件 tagsoup

維護小組:

外部的資源:

相似套件:

SAX-compliant parser for real-life HTML

TagSoup, a SAX-compliant parser written in Java that, instead of parsing well-formed or valid XML, parses HTML as it is found in the wild: poor, nasty and brutish, though quite often far from short. TagSoup is designed for people who have to process this stuff using some semblance of a rational application design.

By providing a SAX interface, it allows standard XML tools to be applied to even the worst HTML. TagSoup also includes a command-line processor that reads HTML files and can generate either clean HTML or well-formed XML that is a close approximation to XHTML.

TagSoup is designed as a parser, not a whole application; it isn't intended to permanently clean up bad HTML, as HTML Tidy does, only to parse it on the fly. Therefore, it does not convert presentation HTML to CSS or anything similar. It does guarantee well-structured results: tags will wind up properly nested, default attributes will appear appropriately, and so on.

標籤: 實做語言: Java, 支援的格式: HTML, 超本文標記語言, works-with-format::xml, works-with::text

其他與 libtagsoup-java 有關的套件

  • 依賴
  • 推薦
  • 建議
  • 增強

下載 libtagsoup-java

下載可用於所有硬體架構的
硬體架構 套件大小 安裝後大小 檔案
all 99。5 kB128。0 kB [檔案列表]