MarkGrab Plugin for Claude Code
Universal web content extraction tool that converts any URL to LLM-ready markdown.
MarkGrab Plugin for Claude Code
Universal web content extraction — any URL to LLM-ready markdown.
Skills
| Skill | Description |
|---|---|
extract-url | Extract content from a web URL (HTML, YouTube, PDF) |
extract-file | Convert local PDF/DOCX to markdown |
batch-extract | Bulk extract from multiple URLs |
Requirements
pip install markgrab # core
pip install "markgrab[all]" # all content types
Supported Content Types
- HTML — content density filtering, auto-fallback to Playwright for JS-heavy sites
- YouTube — transcript extraction with timestamps and multi-language support
- PDF — text extraction with page structure
- DOCX — paragraph and heading extraction
Links
<sub>Part of the QuartzUnit ecosystem — composable Python libraries for data collection, extraction, search, and AI agent safety.</sub>