Showing 370 open source projects for "java html parser"

View related business solutions
  • Supercharge Your Manufacturing with Easy MRP and MES Software Icon
    Supercharge Your Manufacturing with Easy MRP and MES Software

    Designed for SME manufacturers who want to reduce wasteful manual processing, save time and increase profits.

    Flowlens eliminates stock-outs, shortage and overstocks, avoiding costly production delays. Stay in control of inventory levels and keep production running smoothly with real-time visibility and easy-to-use stock management. Import bulk data with ease.
    Learn More
  • Multi-Entity Cloud Accounting Software for Growing Businesses Icon
    Multi-Entity Cloud Accounting Software for Growing Businesses

    Built for small to midsize businesses that have outgrown entry-level accounting or legacy ERP solutions.

    Built natively on the Microsoft Power Platform (Dynamics 365), Gravity delivers robust multi-entity financial management with seamless integration to Microsoft 365, Power BI, Teams + Copilot — no third-party add-ons required.
    Learn More
  • 1
    Ace Operator is a tool for enabling live communications between company representatives and online users from web sites. By clicking on the "live help" button, customers can get access to company representatives and exchange information in real-time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    crawler4j

    crawler4j

    Open source web crawler for Java

    crawler4j is an open source web crawler for Java which provides a simple interface for crawling the Web. Using it, you can setup a multi-threaded web crawler in few minutes. You need to create a crawler class that extends WebCrawler. This class decides which URLs should be crawled and handles the downloaded page. shouldVisit function decides whether the given URL should be crawled or not.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Web Widget Toolkit (WTK): Server-side components for easily creating web-based user interfaces with complex navigation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    JuniCoder is a Java project that uses unicode as a base for decoding and encoding formats that invented workarounds to express characters not covered by ASCII. Decoders translate those inventions to unicode. Encoders encode to these inventions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Track time for payroll, billing and productivity Icon
    Track time for payroll, billing and productivity

    Flexible time and billing software that enables teams to easily track time and expenses for payroll, projects, and client billing.

    Because time is money, and we understand how challenging it can be to keep track of employee hours. The constant reminder to log timesheets so your business can increase billables, run an accurate payroll and remove the guesswork from project estimates – we get it.
    Learn More
  • 5
    cpDetector is a proxy for codepage detection of documents. It delegates to multiple instances that try to detect the codepage by different techinques. A command line executeable is shipped that allows to sort documents by codepage.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Lanproxy

    Lanproxy

    Intranet penetration tool that proxies local area network computers

    Lanproxy is an intranet penetration tool that proxies local area network personal computers and servers to the public network. It supports tcp traffic forwarding and any tcp upper layer protocol (access to intranet websites, local payment interface debugging, ssh access, remote desktop, http proxy) , https proxy, socks5 proxy...). Penetration basic functions, same as the open source version, high performance, can support tens of thousands of penetration connections at the same time. Support...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    The MangaStream Downloader is an open source application written in Java for managing and downloading manga from the site mangastream.com and mangafox.me. It is written under the GNU-GPL license and uses an open source HTML parser - TagSoup. Follow the project page on Facebook for updates: https://www.facebook.com/MangastreamDownloader
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Gecco

    Gecco

    Lightweight Java web crawler framework with jQuery-style extraction

    Gecco is a lightweight web crawler framework written in Java that simplifies the process of building web scraping applications. It is designed to make crawler development straightforward by allowing developers to extract page elements using jQuery-style selectors rather than complex parsing logic. It integrates several well-known Java libraries and frameworks, including tools for HTTP requests, HTML parsing, JSON processing, and application development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    webmongo

    Accessing server-side mongodb through client javascript API.

    Accessing server-side mongodb through client javascript API. This project is a branch of dbcloud You can do almost invoke on mongodb through the javascript API in browser. The client javascript api support IE6.0+ Chrome FireFox and Wechat
    Downloads: 1 This Week
    Last Update:
    See Project
  • Accounting practice management software Icon
    Accounting practice management software

    Accountants, accounting firms, tax attorneys, tax professionals

    Canopy is a cloud-based practice management software for accounting and tax firms, offering tools for client engagement, document management, workflow automation, and time & billing. Its Client Engagement platform centralizes interactions with a secure portal, customizable branding, and email integration, while the Document Management system enables organized, paperless file storage. The Workflow module enhances visibility into tasks and projects through templates, task assignments, and automation, reducing human error. Additionally, the Time & Billing feature tracks billable hours, generates invoices, and processes payments, ensuring accurate financial management. With its comprehensive features, Canopy streamlines operations, reduces stress, and enhances client experiences.
    Learn More
  • 10

    Java-WebTTS

    API-Makes static web pages readable with no coding

    This JAVA API helps create java web applications where static html pages can be read out to viewer. It helps people who are visually challenged partially. It is helpful to common people and children , too, and might work out great in educational site. It's fully customized and does not need developer to write a single line of code . All you need to do is to assign a specific id to the DOM element , whose innerHTML you want to be read out.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Save For Offline

    Save For Offline

    Android app for saving webpages for offline reading

    Android app for saving webpages for offline reading. Save For Offline is an Android app for saving full web pages for offline reading, with lots of features and options. In you web browser selects 'Share', and then 'Save For Offline'. Saves real HTML files which can be opened in other apps/devices. Download & save entire web pages with all assets for offline reading & viewing. Save HTML files in a custom directory. Save in the background, no need to wait for it to finish saving. Night mode,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Simple-Scrape is a simple web-scraping library that allows for programmatic access to HTML code. No further techniques are needed and the library is very compact and thus easy to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    NOTICE: This code repository is deprecated. Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14

    filtrando google sheets

    Aplicación que permite filtrar hojas de cálculo de google

    Aplicación que permite conectarse a una hoja de cálculo de google y filtrar sus datos, mediante una fórmula de filtrado. Es un poco complejo utilizarla, pero dispone de documentación detallada sobre cómo funciona paso a paso. You will find more about it at this web site: https://www.frojasg1.com:8443/downloads_web/web/html/googleSheets.html?origin=sourceforge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    ItsNat is a Component based AJAX Java Web Application Framework.No XML programming,no mixed view/code,no custom JavaScript.Only pure HTML,pure Java and server centric Swing-like programming with W3C standards where "The Browser is The Server"
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Chunk, an HTML Template Engine for Java

    Chunk, an HTML Template Engine for Java

    Clean, powerful templates for Java

    A powerful Java Template Engine, great for building HTML or XML docs. Chunk can handle many other needs and situations as well. In-tag filters & default values, multiple snippets per file, layered themes, macros, conditional includes, localization & more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    CleanCode

    CleanCode

    .NET, PowerShell, SQL, Java, Perl, and Javascript developer libraries

    Develop clean code with our .NET components (plus PowerShell, SQL, Java, Perl, and JavaScript components as well!). CleanCode highlights include user controls, a validation engine, a diagnostic system, an XML/HTML pre-processor, and a variety of articles on code design.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19

    Extra Life Donation Parser

    Parses donations from EL and displays them

    Intended to be used during a live stream, this parser will poll for new donations based on a comma-delimited list of ids and display them in a frameless background with a total. Colors can be changed without closing the display to support streamers using a chroma key. Licensed under GPL version 3
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Jigy Generator Linux

    Jigy Generator Linux

    The best way to generate a Spring Java Web Application. It just works!

    The Jigy Generator is the simplest way to build and code a Java web application with a Spring Framework by using a project creation wizard and now it's been made even simpler by downloading a very small HTML file (337 bytes) and filling out a few form fields and let it construct your project with ease! The resulting project will be a Java web application with a Spring Framework configured to run in Eclipse or Netbeans.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Jigy Generator OSX

    Jigy Generator OSX

    The best way to generate a Spring Java Web Application. It just works!

    The Jigy Generator is the simplest way to build and code a Java web application with a Spring Framework by using a project creation wizard and now it's been made even simpler by downloading a very small HTML file (337 bytes) and filling out a few form fields and let it construct your project with ease! The resulting project will be a Java web application with a Spring Framework configured to run in Eclipse or Netbeans.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    NekoHTML is a simple HTML scanner and tag balancer that enables application programmers to parse HTML documents and access the information using standard XML interfaces.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    wikihtml

    Converts wikitext documents into HTML documents

    This project is an application that converts wikitext documents into HTML documents. Wiki markup or wikitext is a markup language to write documents in wiki-based systems, such as web sites powered by MediaWiki.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    xowa

    xowa

    A free, open-source, offline Wikipedia application

    XOWA is a desktop application for reading and editing Wikipedia offline (XOWA has moved to http://gnosygnu.github.io/xowa/download.html)
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    Aspose Java for Liferay

    Aspose Java for Liferay

    Provides export options for blogs, journals and dynamic lists

    This is Liferay CMS / Portal plugin released by Aspose pty ltd. Aspose.Total Java for Liferay (hook plugin app) provides options for exporting web-contents and blogs created in html to MS-WORD, MS-EXCEL and PDF file formats using Aspose.Total Java APIs. (Aspose.Words, Aspose.Cells and Aspose.PDF) The Plugin also provides very useful functionality / options for exporting the Dynamic Data Lists to MS-WORD, MS-EXCEL and PDF file formats using Aspose.Total Java APIs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB