Showing 22 open source projects for "java html parser"

View related business solutions
  • Tremendous is the global payouts platform for businesses sending gift cards and money at scale. Icon
    Tremendous is the global payouts platform for businesses sending gift cards and money at scale.

    Getting started is simple: add a funding method and place your first order in minutes.

    Trusted by 20,000+ leading organizations, Tremendous has delivered billions of rewards and enables businesses to reach recipients across 230+ countries and regions. Recipients have 2,500+ payout options to choose from, including gift cards, prepaid cards, cash transfers, and charitable donations.
    Learn More
  • Download the most trusted enterprise browser Icon
    Download the most trusted enterprise browser

    Chrome Enterprise brings enterprise controls and easy integrations to the browser users already know and love.

    Chrome Enterprise is ideal for businesses of all sizes, IT professionals, and organizations looking for a secure, scalable, and easily managed browser solution that supports remote work, data protection, and streamlined enterprise operations.
    Learn More
  • 1
    TXM

    TXM

    Unicode XML TEI text analysis platform

    TXM is a free and open-source cross-platform Unicode & XML based text analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in. DOWNLOAD LATEST VERSION OF TXM : http://textometrie.ens-lyon.fr/spip.php?rubrique61&lang=en TXM offers a comprehensive range of analysis tools (concordances, collocate search, frequency lists, etc.) based on the powerfull CQP...
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    FastQC

    FastQC

    A quality control analysis tool for high throughput sequencing data

    FastQC is a quality control analysis tool designed to spot potential problems in high throughput sequencing datasets. Its goal is to provide a simple way by which to check the quality of raw sequence data coming from high throughput sequencing pipelines. It does this by running a modular set of analyses on one or more raw sequence files in fastq or bam format. It then produces a report summarizing the results, and highlighting any areas where the library may appear unusual. This should then...
    Downloads: 28 This Week
    Last Update:
    See Project
  • 3

    Ghawwas_V4

    An open source system for Arabic corpora processing

    Ghawwas (previously known as Khawas) is an open source system for Arabic corpora processing. Ghawwas V4.0 provides the following main functions: a. Frequency list for single word and N-Grams b. Concordance c. Collocation (MI, CHI Squared, LL, T-Score, Z Score, Dice, Log Dice, Weirdness Coefficient) d. Lexical patterns search e. Two corpora frequency profile comparison based on MI, CHI, LL, T-Score, Z Score, Dice, Log Dice, Weirdness Coefficient f. Accept Windows and UTF-8 character...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 4
    Universal Tag Finder

    Universal Tag Finder

    Is tool for query HTML content

    Universal Tag Finder is a tool to query against html content in the file system. For developers it allows to find out relevant elements during the troubleshooting and validating periods. Does not need to bother on massive regex to find out elements.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Dominate AI Search Results Icon
    Dominate AI Search Results

    Generative Al is shaping brand discovery. AthenaHQ ensures your brand leads the conversation.

    AthenaHQ is a cutting-edge platform for Generative Engine Optimization (GEO), designed to help brands optimize their visibility and performance across AI-driven search platforms like ChatGPT, Google AI, and more.
    Learn More
  • 5
    Simple-Scrape is a simple web-scraping library that allows for programmatic access to HTML code. No further techniques are needed and the library is very compact and thus easy to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    P-VCD

    Content-Based Video Copy Detection

    Software for Video Copy Detection. The software has been described in some scientific papers, e.g. http://dx.doi.org/10.1109/ICME.2011.6012212 and http://dx.doi.org/10.1007/s11042-011-0915-x. This software is the result of my PhD at the University of Chile and the participation at TRECVID Content-Based Copy Detection (CCD) evaluation task 2010 and 2011. More details in http://www-nlpir.nist.gov/projects/tv2011/tv2011.html#ccd and in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    HadStat

    HadStat is service on cloud,for data analysis using Hadoop MapReduce.

    HadStat is service on the cloud, allow you to analysis the data on the cloud and return the result in nice graph,this service is free, you can redistribute it and/or modify it under the terms of the GNU General Public License. this service using many technologies , like Hadoop mapreduce, HTML, PHP, Web Service applications, linux server, java, eclipse IDE, with many indicators:Simple moving average (SMA),Exponential moving average (EMA),Smoothed simple moving average (SMMA),Linear weighted moving average (LWMA )on DATA from NYSE daily prices.....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    ssSearchEngine

    keyword search engine for semi-structured data (Tables, lists,...)

    This application implement an approach for doing keyword based search over semi-structured data available in HTML pages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Optex Analyzer is a software to analyze and compare algorithms to solve approximately optimization problems. It has a GUI that allows select a set of input files containing raw algorithm results. The analysis is shown with tables and charts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Self-hosted password manager Icon
    Self-hosted password manager

    Developed and headquartered in Europe (Barcelona, Spain), Passwork meets GDPR, NIS2, ENS and other European regulatory requirements by design.

    On-premise solution with double encryption and certified development processes for maximum protection of corporate data. Zero‑knowledge architecture ensures your passwords never leave your infrastructure.
    Learn More
  • 10
    * Java classes for parsing text, conversion to XML or to evaluate in Java. The parser is textual-script-controlled with a syntax near Backus Naur Format, named ZBNF. * Some routines for conversion: C-Header or Java to XMI, XML-Documentation generation,
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    OpenDMAP (Open Source Direct Memory Access Parser) is a natural language processing (text mining) application: a semantic parser for information extraction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Open extensible system analysis report tool for Java, based on numerous open source analysis initiatives. The XML/XSL batch-processing framework produces integrated HTML/SVG reports of the systems current state and the development over time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    A bioinformatics package to analyse ESTs (Expressed Sequence Tags) easily. It contains functionalities of ESTs for EST submission, Batch BLAST and BLAST result parser, etc. easiEST contains JAVA GUI applications for analysing ESTs or other sequences
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Provides a GUI interface to grammatical structure and relations (as parsed by the Stanford Parser) of any text. Contains grammatical relation editor to modify, import, export grammatical relation definitions (tregex patterns and features).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Automatic generation of documentation on Delphi projects from source code. Distinctive features are exact parsing gathering lots of information and a division of the parser and configurable generators (HTML, Win- & HTML-Help, PDF, LaTeX, XMI export)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Docgen plugin for Protege provides a quick export of all the content of an ontology (classes, instances and documentation) in various formats (html, pdf, fo...). Images, Graphs, URL are readyly included in reports.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    This program helps understanding the content of any MPEG compressed video stream by displaying its structure and content in a tree based view.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    The aim of MIEX (Metadata and Information Extractor from small XML documents) is to create a wrapper for the Stanford Parser, to extract and store metadata (syntactic structures, relationships among words...) from simple XML documents.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    TagHybrida is a French hybrid syntactic parser. TagHybrida is a four stage parser combining hand-writen and corpus based information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    XML Parser, Data Processing API and Specific Implementations for SensorML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    HyperSpider (Java app) collects the link structure of a website. Data import/export from/to database and CSV-files. Export to Graphviz DOT, Resource Description Framework (RDF/DC), XML Topic Maps (XTM), Prolog, HTML. Visualization as hierarchy and map.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    OBIEE UDML (Universal Database Markup Language) parser, providing an XML output, enabling other applications to take advantage of the meta data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB