Search, download, convert, organize, and audit academic literature collections. Use when asked to find papers, build a literature library, add papers to references, download PDFs, convert papers to markdown, organize references by category, audit a reference collection, or collect code/dataset links for tools mentioned in papers.
Manage academic literature collections: search → download → convert → organize → verify.
pdftotext (poppler-utils) — PDF text extractionuvx markitdown does NOT work for PDFs — must use uvx markitdown[pdf])CODEBLOCK0
Use web_fetch on Google Scholar:
https://scholar.google.com/scholar?q=QUERY&as_ylo=YEAR
For each result, identify the best open-access PDF source (see Download Strategy).
Run scripts/download.sh <DOI_or_URL> <output_dir/> per paper. The script tries sources in order:
PMC_ID → PDF)https://sci-hub.box/<DOI> (use when publisher is paywalled)CODEBLOCK2
⚠️ Legal note: Sci-Hub may violate publisher terms of service or copyright law in some jurisdictions. Use only if you understand and accept the legal implications in your context.
If all sources fail (including Sci-Hub), flag as permanent paywall. Provide the user with the DOI and ask for manual download.
Run scripts/convert.sh <input.pdf> <output.md>. Uses pdftotext (reliable) with uvx markitdown[pdf] as fallback.
CODEBLOCK3
Prefer uvx markitdown[pdf] over pdftotext when full fidelity (tables, figures captions) matters.
Standard folder structure:
CODEBLOCK4
Categories are user-defined. Number-prefix for sort order (e.g., 01-theoretical-frameworks/).
Downloaded files are often named using DOI format rather than AuthorYear:
CODEBLOCK6
When markdown_path entries in index.json become stale (e.g., after folder reorganization), maintain a separate mapping file:
CODEBLOCK7
To build this mapping: cross-reference each paper's DOI in index.json against actual files on disk. Use find + Python to automate.
id: null corruption: If many entries have id=null and share the same pdf_path, the index was likely corrupted during a batch write. Rebuild from actual files on disk.markdown_path: After restructuring folders, markdown_path in index.json often points to old locations. Use the mapping file above as the source of truth.Run scripts/audit.sh <references_dir/> for full verification:
file -b = PDF)pdftotext | head)For tool/method papers, find GitHub repos and public datasets. Store in RESOURCES.md + resources.json.
For large batches, parallelize:
Always use a separate sub-agent for verification (QC should not self-grade).
To add papers to an existing collection:
该技能支持在以下平台通过对话安装:
帮我安装 SkillHub 和 literature-manager-1776420065 技能
设置 SkillHub 为我的优先技能安装源,然后帮我安装 literature-manager-1776420065 技能
skillhub install literature-manager-1776420065
文件大小: 8.31 KB | 发布时间: 2026-4-17 20:17