Earn 14,000 ($140.00)
Convert Java library to JavaScript/TypeScript (for use in NodeJS)
Bounty Description
Problem Description
Google Boilerpipe library is currently a Java library
https://code.google.com/archive/p/boilerpipe/
Description of the library: "The boilerpipe library provides algorithms to detect and remove the surplus "clutter" (boilerplate, templates) around the main textual content of a web page."
We need this Boilerpipe library converted from Java to a JavaScript/TypeScript library so that it can be used within a NodeJS application as an imported node module.
NOTE: There already exists a JavaScript version of Boilerpipe but it's simply a wrapper around the Java version of Boilerpipe and requires installation of Java to use. https://www.npmjs.com/package/boilerpipe
We do not want this version of the library because it requires installation of Java. You can use this library for syntax ideas, of course.
This new JavaScript/TypeScript version of Boilerpipe should not use Java in any way. It should be able to be run in a NodeJS application without having Java installed.
Acceptance Criteria
- The final library should be written in JavaScript/TypeScript (and should not require Java to be installed).
- The final library should be able to be imported as a node module in a NodeJS project.
- The final library should accurately and correctly reproduce all the functionality of the original Boilerpipe library. Every function and class in the original library should be available in the new JavaScript/Typescript version of the library.
- As part of the acceptance criteria, we'll test the final library's output versus the original Boilerpipe library to ensure the new library returns the same results as the original. Please ensure they return the same results on multiple websites' html.
Please feel free to ask any clarifying questions.