20 releases

Uses new Rust 2024

new 0.1.19 Apr 8, 2026
0.1.18 Mar 30, 2026
0.1.4 Feb 26, 2026
0.1.0 Jan 25, 2026

#2170 in Parser implementations

Download history 42/week @ 2026-01-21 26/week @ 2026-01-28 96/week @ 2026-02-04 77/week @ 2026-02-11 65/week @ 2026-02-18 41/week @ 2026-02-25 48/week @ 2026-03-04 74/week @ 2026-03-11 44/week @ 2026-03-18 10/week @ 2026-03-25

184 downloads per month

Apache-2.0

1MB
19K SLoC

File Parser Module

File parsing module for CyberFabric / ModKit.

Overview

The cf-file-parser crate implements the file-parser module and registers REST routes.

Parsing backends currently include:

  • Plain text
  • HTML
  • PDF
  • DOCX
  • Images
  • Stub parser (fallback)

Configuration

modules:
  file-parser:
    config:
      max_file_size_mb: 100
      # Required. Only files under this directory are accessible via parse-local.
      # Symlinks that resolve outside this directory are also blocked.
      allowed_local_base_dir: /data/documents

Security: Local Path Restrictions

The parse-local endpoints validate requested file paths before any filesystem access:

  1. Paths containing .. components are always rejected.
  2. The requested path is canonicalized (symlinks resolved) and must fall under allowed_local_base_dir.
  3. allowed_local_base_dir is required — the module will fail to start if it is missing or the path cannot be resolved.

License

Licensed under Apache-2.0.

Dependencies

~82MB
~1.5M SLoC