💻 programming

Tabled

Name: Tabled
Brand: Tabled
Price: 免费 CNY
Availability: InStock

Tools to detect and extract tables into Markdown and CSV formats

#automation

#machine learning

#Data extraction

#PDF processing

#Table detection

Try Now

Product Details

Tabled is a Python library for detecting and extracting tables. It uses surya to identify tables in PDFs, identify rows and columns, and be able to format cells into Markdown, CSV, or HTML. This tool is very useful for data scientists and researchers who often need to extract tabular data from PDF documents for further analysis. Key advantages of Tabled include highly accurate table detection and extraction capabilities, support for multiple output formats, and an easy-to-use command line interface. Additionally, it provides an interactive APP that allows users to intuitively try using Tabled on images or PDF files.

Main Features

Detect tables in PDF and identify rows and columns

Format tabular data into Markdown, CSV or HTML

Supports two usage methods: command line tool and interactive APP

Automatically download model weights without additional configuration

JSON file saving options providing additional row and column information

Ability to save debug images showing detected rows, columns and cells

Supports direct calls from Python code for easy integration into larger workflows

How to Use

1. Install Python 3.10+ and PyTorch.

2. Use pip to install Tabled: 'pip install tabled-pdf'.

3. Run Tabled and specify the data path: 'tabled DATA_PATH'.

4. Use the --format option to specify the output format (Markdown, HTML, or CSV).

5. If needed, use the --save_json option to save additional row and column information.

6. Use the --save_debug_images option to save debug images.

7. If the image is already a cropped table, use the --skip_detection option.

8. View the generated 'results.json' file, which contains the extracted tabular data.

Target Users

Tabled's target audience is primarily data scientists, researchers, and developers who need to extract tabular data from PDF documents for data analysis or further processing. This tool is suitable for them as it provides high-accuracy table detection and extraction, supports multiple output formats, and is easy to integrate into existing workflows.

Examples

✓

Researchers use Tabled to extract data from PDFs of academic papers for statistical analysis.

✓

Data scientists use Tabled to convert tabular data in market research reports into CSV format for use in economic forecasting models.

✓

Developers integrate Tabled into their software products to provide automated PDF table data processing capabilities.

Quick Access

Visit Website →

Related Recommendations

Discover more similar quality AI tools

Prisma Optimize

Prisma Optimize is a tool that uses artificial intelligence technology to analyze and optimize database queries. It accelerates applications by providing in-depth insights and actionable recommendations to make database queries more efficient. Prisma Optimize supports a variety of databases, including PostgreSQL, MySQL, SQLite, SQL Server, CockroachDB, PlanetScale, and Supabase, and can be seamlessly integrated into existing technology stacks without the need for large-scale modifications or migrations. The main advantages of the product include improving database performance, reducing query latency, optimizing query patterns, etc. This is a powerful tool for developers and database administrators to help them manage and optimize databases more effectively.

Tabled

Product Details

Main Features

How to Use

Target Users

Examples

Quick Access

Categories

Related Recommendations

Prisma Optimize

Knowledge Table

VARAG

GraphReasoning

AgentRE

magic-html

TAG-Bench

CyberScraper 2077

Triplex

Datalore

Korvus

Crawlee

LAMDA-TALENT

APIGen

DB-GPT

Yayi Information Extraction Large Model