jsoup

jsoup Java HTML Parser
Developer(s)Jonathan Hedley
Stable release
1.18.1 / July 10, 2024; 22 days ago (2024-07-10)[1]
Repository
Written inJava
Operating systemCross-platform
PlatformJava (JVM)
TypeHTML parser
LicenseMIT license
Websitejsoup.org

jsoup is an open-source Java library designed to parse, extract, and manipulate data stored in HTML documents.

History

[edit]

jsoup was created in 2009 by Jonathan Hedley. It is distributed it under the MIT License, a permissive free software license similar to the Creative Commons attribution license.

Hedley's avowed intention in writing jsoup was "to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup."

Projects powered by jsoup

[edit]

jsoup is used in a number of current projects,[2] including Google's OpenRefine data-wrangling tool.

See also

[edit]

References

[edit]
  1. ^ "jsoup Java HTML Parser release 1.18.1". Retrieved 10 Jul 2024.
  2. ^ "Jsoup". MVNRepository / F. Rodriguez. 2015-03-08.
[edit]