Fork me on GitHub

HPar

A Parallel HTML5 Parser based on Jsoup

Download .jar Download Source .zip Download Source .tar.gz

Parallel - Better Real Time

On Linux, Intel® Core™ i7-4710MQ CPU @ 2.50GHz × 8, HPar achieves speedups as high as 2.3x.

Enjoy it

The structure parser return is compatible to Jsoup 1.9.1. All you need to do is include this jar file and Jsoup 1.9.1, then enjoy it.

More information please check out at: https://github.com/sunxfancy/HPar


String data = "<html></html>";

// a new paralleled Jsoup with 8 threads
ParallelJsoup pj = new ParallelJsoup(data, 8);

// parsing the HTML data
Document document = pj.parse();
                        

Licences - MIT