Java PDF Parser to Text and Imagae

Baidu OCR Breaks Long-Document Memory Wall: New Architecture Beats DeepSeek

Open-source OCR from Baidu eliminates the GPU memory wall that limits long-document parsing. Unlimited OCR uses a constant KV ...

IEEE

Harnessing Text Insights With Visual Alignment for Medical Image Segmentation

Abstract: Pre-trained vision-language models (VLMs) and language models (LMs) have recently garnered significant attention due to their remarkable ability to represent textual concepts, opening up new ...

The New York Times

4 Easy Ways to Make Sure No One Can Read Your Text Messages

We independently review everything we recommend. When you buy through our links, we may earn a commission. Learn more› By Max Eddy Max Eddy is a writer who has covered privacy and security — including ...

National Bureau of Economic Research

Measuring Industrial Policy: A Text-Based Approach

Since the 18th century, policymakers have debated the merits of industrial policy (IP). Yet, economists lack basic facts about its use due to measurement challenges. We propose a new approach to IP ...

IEEE

A Generation Algorithm for “Text to Image” Based on Multi-Channel Attention

Abstract: Research on text-to-image has gained significant attention. However, existing methods primarily rely on upsampling convolution operations for feature extraction during the initial image ...

GitHub

Source Parser

This package contains tools for parsing source code into annotated json data structure: we extracted import statements, global assignments, top-level methods, classes, class methods and attributes, ...

GitHub

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

🔍 PDF parser for AI data extraction — Extract Markdown, JSON (with bounding boxes), and HTML from any PDF. #1 in benchmarks (0.907 overall). Deterministic local mode + AI hybrid mode for complex ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results