Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md
SKILL.md	SKILL.md

Name

Last commit message

Last commit date

/product-spec-pdf-parser

PDF product spec parser for Claude Code. Feed it price books, fact sheets, or spec sheets — get structured FF&E data written to your master Google Sheet.

Dependencies

PyMuPDF — PDF text extraction
```
pip install PyMuPDF
```

Usage

/product-spec-pdf-parser

Then provide PDF paths — point to individual files or a folder.

/product-spec-pdf-parser ~/Documents/specs/alphabeta-floor-lamp.pdf

Or a folder:

/product-spec-pdf-parser ~/Documents/specs/

Variant depth

expand (default) — one row per variant/SKU. Best for procurement.
summarize — one row per product, variants comma-separated. Best for quick reference.

Output

Appends rows to the master Google Sheet using the 33-column schema. PDF-specific data (variant, price adder, country of origin, source filename) is stored in the Notes column. Can also output to local CSV or markdown.

How it fits

This is a utility — it can be called standalone or as part of a larger workflow:

Context	How it's used
Standalone	Designer has spec sheets or catalogs to process
`/product-research`	Designer drops a PDF from a rep into the conversation
Product & Materials Researcher agent	Agent delegates to this skill for PDF-based spec extraction

Output Schema

33 columns matching the master schema. Key fields populated:

Field	Example
Product Name	Alphabeta Floor Lamp
SKU	HEM-AF-DB
Brand	Hem
Designer	Luca Nichetto
Category	Light
Materials	Aluminium, Steel
List Price	595.00 EUR
Notes	Variant: Diamond, Black \| Origin: Sweden \| Source: alphabeta-fact-sheet.pdf
Source	`pdf-parser`

PDF Types Supported

Type	Variant strategy
Fact sheet with SKUs	One row per SKU (shade × color)
Fact sheet with finishes	One row per upholstery option
Price book / configurator	One row per product type, options summarized
Product catalog	Rows for each distinct product
Spec sheet	One row with full detail

Error Handling

Scanned/image PDFs — detected and flagged for OCR
Password-protected PDFs — caught and reported
Large PDFs (100+ pages) — processed in 10-page chunks with progress updates

After every batch: Parsed: X products from Y PDF(s)

Works with

Skill	Relationship
`/product-research`	Designer drops a PDF during research, this extracts the data
`/product-data-cleanup`	Run after parsing to normalize the sheet
`/product-image-processor`	Run after parsing to process product images

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

/product-spec-pdf-parser

Dependencies

Usage

Variant depth

Output

How it fits

Output Schema

PDF Types Supported

Error Handling

Works with

License

FilesExpand file tree

product-spec-pdf-parser

Directory actions

More options

Directory actions

More options

Latest commit

History

product-spec-pdf-parser

Folders and files

parent directory

README.md

/product-spec-pdf-parser

Dependencies

Usage

Variant depth

Output

How it fits

Output Schema

PDF Types Supported

Error Handling

Works with

License