Skip to content

Commit c74c7b7

Browse files
GiaoLeeclaude
andcommitted
feat: update skill-auditor scoring rubric, evaluate script, and SKILL metadata
- Revamp scoring_rubric.md with expanded criteria and section weights - Enhance evaluate_skill.py with improved evaluation logic - Update skill-auditor SKILL.md with latest capabilities - Refresh report_json_schema.md, scientific_veto.md, and academic writing evaluation references - Minor metadata updates across multiple SKILL.md files in awesome-med-research-skills and scientific-skills Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
1 parent f58bf43 commit c74c7b7

21 files changed

Lines changed: 269 additions & 139 deletions

File tree

awesome-med-research-skills/Data Analysis/differential-expression-analysis/SKILL.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
22
name: differential-expression-analysis
3-
description: Use when analyzing bulk RNA-seq or microarray expression data to identify differentially expressed genes between two biological groups (case vs control), with volcano plots and heatmap visualization. NOT for: single-cell RNA-seq, methylation analysis, non-expression data.
3+
description: Use when analyzing bulk RNA-seq or microarray expression data to identify differentially expressed genes between two biological groups (case vs control), with volcano plots and heatmap visualization. NOT for:single-cell RNA-seq, methylation analysis, non-expression data.
44
license: MIT
55
author: AIPOCH
66
---

awesome-med-research-skills/Evidence Insight/figure-first-paper-reader/SKILL.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
22
name: figure-first-paper-reader
3-
description: Reads a paper figure by figure before re-integrating the full narrative, so the user can identify the core findings quickly and check whether each visual actually supports the authors' main claims. Always separate figure content, figure-linked claim, evidentiary strength, and unsupported interpretation. Never fabricate references, PMIDs, DOIs, figure content, panel labels, result values, or study details that were not actually provided.
3+
description: "Reads a paper figure by figure before re-integrating the full narrative, so the user can identify the core findings quickly and check whether each visual actually supports the authors' main claims. Always separate figure content, figure-linked claim, evidentiary strength, and unsupported interpretation. Never fabricate references, PMIDs, DOIs, figure content, panel labels, result values, or study details that were not actually provided."
44
license: MIT
55
author: AIPOCH
66
---
@@ -270,3 +270,4 @@ A strong output from this skill should:
270270
- and leave the user with a clear sense of whether the paper's story still holds after a figure-first audit.
271271

272272
A weak output merely paraphrases captions, repeats the abstract, or praises figures without checking whether they actually support the claims.
273+

awesome-med-research-skills/Evidence Insight/litbase/SKILL.md

Lines changed: 2 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -3,20 +3,6 @@ name: litbase
33
description: "Academic paper reading and research development system for biomedical researchers. Finds papers via Semantic Scholar, reads with structured notes, tracks discussion insights, and synthesizes literature into a Research Foundation Document (RFD) for downstream protocol design skills. 8 commands: /setup /feed /read /discuss /recap /update /sync /propose"
44
license: MIT
55
author: AIPOCH
6-
metadata:
7-
openclaw:
8-
optional_bins:
9-
- python # optional: accelerates paper search if available; falls back to inline WebFetch calls
10-
- pdftotext # optional: accelerates PDF text extraction if available; falls back to Claude native PDF reading
11-
capability_tiers:
12-
tier_a: "Conversation mode — Web Claude or any LLM chat interface; no file system; notes output as Artifacts; state maintained via session card"
13-
tier_b: "File mode — Manus or any file-capable agent; file read/write available; no Python/bash required"
14-
tier_c: "Full mode — OpenClaw / Claude Code; full file system, bash, and optional Python acceleration"
15-
downstream_skills:
16-
- clinical-cohort-protocol-designer
17-
- translational-study-blueprint
18-
- statistical-analysis-plan-writer
19-
- protocol-writer
206
---
217
> **Source**: [https://github.com/aipoch/medical-research-skills](https://github.com/aipoch/medical-research-skills)
228
@@ -163,3 +149,5 @@ data_dir/ ← path set in config.json
163149
proposal/
164150
YYYY-MM-DD_RFD.md
165151
```
152+
153+

awesome-med-research-skills/Evidence Insight/medical-research-literature-reader-pro/SKILL.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,8 @@
11
---
22
name: medical-research-literature-reader-pro
3-
description: A medical-research-native literature reading skill for users with clinical, bioinformatics, translational, and basic experimental backgrounds. Use this skill whenever a user wants to read, analyze, critique, or interpret a medical or scientific paper — whether they provide a PDF, abstract, DOI, PMID, or just a title. Triggers include requests like "analyze this paper", "critique this study", "is this a strong paper?", "give me similar studies", "prepare me for journal club", "help me understand this bioinformatics paper", "what are the weaknesses here?", or "turn this into a mind map". Also activate for any downstream deliverables such as journal club kits, comparison tables, PI decision briefs, replication starters, or follow-up experiment designs. Do NOT treat as a generic summarizer — this skill performs structured evidence-type classification, track-specific critical appraisal, interpretation-boundary judgment, and research-grade follow-up generation.
4-
version: 1.0.0
5-
author: AIPOCH
3+
description: "A medical-research-native literature reading skill for users with clinical, bioinformatics, translational, and basic experimental backgrounds. Use this skill whenever a user wants to read, analyze, critique, or interpret a medical or scientific paper — whether they provide a PDF, abstract, DOI, PMID, or just a title. Triggers include requests like \\\"analyze this paper\\\", \\\"critique this study\\\", \\\"is this a strong paper?\\\", \\\"give me similar studies\\\", \\\"prepare me for journal club\\\", \\\"help me understand this bioinformatics paper\\\", \\\"what are the weaknesses here?\\\", or \\\"turn this into a mind map\\\". Also activate for any downstream deliverables such as journal club kits, comparison tables, PI decision briefs, replication starters, or follow-up experiment designs. Do NOT treat as a generic summarizer — this skill performs structured evidence-type classification, track-specific critical appraisal, interpretation-boundary judgment, and research-grade follow-up generation."
64
license: MIT
5+
author: AIPOCH
76
---
87
> **Source**: [https://github.com/aipoch/medical-research-skills](https://github.com/aipoch/medical-research-skills)
98
@@ -234,3 +233,5 @@ This skill is designed to connect with other skills in a research workflow:
234233
Close every Standard and Expert report with a brief offer of relevant next steps, for example:
235234

236235
> I can also generate a same-type study comparison table, turn this paper into a journal club kit, design follow-up experiments based on the weakest link, or build a replication starter for the computational section. Just let me know.
236+
237+

scientific-skills/Academic Writing/paper-2-web/SKILL.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
2-
name: paper-2-web
3-
description: Use this skill when converting academic papers to promotional and presentation formats, including interactive websites (Paper2Web), presentation videos (Paper2Video), and conference posters (Paper2Poster). This skill is suitable for paper dissemination, conference preparation, creating explorable academic homepages, generating video abstracts, or producing printable posters from LaTeX or PDF source.
2+
name: paper-web
3+
description: "Use this skill when converting academic papers to promotional and presentation formats, including interactive websites (Paper2Web), presentation videos (Paper2Video), and conference posters (Paper2Poster). This skill is suitable for paper dissemination, conference preparation, creating explorable academic homepages, generating video abstracts, or producing printable posters from LaTeX or PDF source."
44
license: MIT
55
author: AIPOCH
66
---
@@ -597,3 +597,4 @@ Result file: paper_2_web_result.md
597597
Validation summary: PASS/FAIL with brief notes
598598
Assumptions: explicit list if any
599599
```
600+

scientific-skills/Data Analysis/3d-molecule-ray-tracer/SKILL.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
2-
name: 3d-molecule-ray-tracer
3-
description: Generate photorealistic rendering scripts for PyMOL and UCSF ChimeraX.
2+
name: d-molecule-ray-tracer
3+
description: "Generate photorealistic rendering scripts for PyMOL and UCSF ChimeraX."
44
license: MIT
55
author: AIPOCH
66
---
@@ -253,7 +253,7 @@ To render:
253253
- **Current Stage**: Draft
254254
- **Next Review Date**: 2026-03-15
255255
- **Known Issues**: None
256-
- **Planned Improvements**:
256+
- **Planned Improvements**:
257257
- Blender integration
258258
- AI-assisted composition suggestions
259259
- Real-time preview mode
@@ -318,3 +318,4 @@ Use the following fixed structure for non-trivial requests:
318318
7. Next Checks
319319

320320
If the request is simple, you may compress the structure, but still keep assumptions and limits explicit when they affect correctness.
321+

scientific-skills/Data Analysis/diagnostic-study-quality-assessment-quadas-2/SKILL.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
2-
name: diagnostic-study-quality-assessment-quadas-2
3-
description: Analyzes clinical diagnostic accuracy studies for bias using the QUADAS-2 tool. Use when Claude needs to assess the quality, risk of bias, or applicability of diagnostic accuracy studies (e.g., "Assess this paper using QUADAS-2").
2+
name: diagnostic-study-quality-assessment-quadas
3+
description: "Analyzes clinical diagnostic accuracy studies for bias using the QUADAS-2 tool. Use when Claude needs to assess the quality, risk of bias, or applicability of diagnostic accuracy studies (e.g., \"Assess this paper using QUADAS-2\")."
44
license: MIT
55
author: AIPOCH
66
---
@@ -123,3 +123,4 @@ The script will automatically extract the text, which you can then copy and send
123123
## References
124124

125125
- [QUADAS-2 Criteria](references/quadas_2_criteria.md): Detailed signaling questions and judgment guidelines.
126+

scientific-skills/Data Analysis/meta-rob2-plot/SKILL.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
name: meta-rob2-plot
2+
name: meta-rob-plot
33
description: "Draw ROB2 risk-of-bias plots, including a Traffic Light Plot and a Summary Bar Plot. Input is a CSV file with ROB2 assessments for each study; output are two PNG plot files."
44
license: MIT
55
author: AIPOCH
@@ -282,3 +282,4 @@ Assumptions: explicit list if any
282282
- Confirm the supported execution path completed without unresolved errors.
283283
- Confirm the final deliverable matches the documented format exactly.
284284
- Confirm assumptions, limitations, and warnings are surfaced explicitly.
285+

scientific-skills/Data Analysis/neurokit2/SKILL.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
2-
name: neurokit2
3-
description: Comprehensive biosignal processing for ECG/PPG/EEG/EDA/RSP/EMG/EOG; use when you need to clean, segment, and extract physiological features for HRV, event-related responses, complexity metrics, or multimodal psychophysiology pipelines.
2+
name: neurokit
3+
description: "Comprehensive biosignal processing for ECG/PPG/EEG/EDA/RSP/EMG/EOG; use when you need to clean, segment, and extract physiological features for HRV, event-related responses, complexity metrics, or multimodal psychophysiology pipelines."
44
license: MIT
55
author: AIPOCH
66
---
@@ -117,9 +117,9 @@ print("Grand average shape:", grand_average.shape)
117117
### Processing pipelines (typical pattern)
118118
Most modalities follow a consistent structure:
119119

120-
1. `*_process(signal, sampling_rate=...)`
120+
1. `*_process(signal, sampling_rate=...)`
121121
Produces a cleaned signal plus intermediate channels (e.g., peaks, phases) and an `info` dict with indices/metadata.
122-
2. `*_analyze(processed_signals, sampling_rate=...)`
122+
2. `*_analyze(processed_signals, sampling_rate=...)`
123123
Computes summary features and automatically selects an analysis mode based on recording length.
124124

125125
Examples:
@@ -172,4 +172,4 @@ Example:
172172
indices = nk.complexity(x, sampling_rate=1000)
173173
apen = nk.entropy_approximate(x)
174174
dfa = nk.fractal_dfa(x)
175-
```
175+
```

scientific-skills/Data Analysis/protocol-deviation-classifier/SKILL.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
22
name: protocol-deviation-classifier
3-
description: Determine whether an incident in a clinical trial is a "major deviation.
3+
description: "Determine whether an incident in a clinical trial is a \"major deviation."
44
license: MIT
55
author: AIPOCH
66
---
@@ -319,7 +319,7 @@ pip install -r requirements.txt
319319
- **Current Stage**: Draft
320320
- **Next Review Date**: 2026-03-06
321321
- **Known Issues**: None
322-
- **Planned Improvements**:
322+
- **Planned Improvements**:
323323
- Performance optimization
324324
- Additional feature support
325325

@@ -381,3 +381,4 @@ If the request is simple, you may compress the structure, but still keep assumpt
381381
- Do not fabricate results, metrics, citations, or downstream conclusions.
382382
- Use safe fallback behavior when dependencies, credentials, or required inputs are missing.
383383
- Surface any execution failure with a concise diagnosis and recovery path.
384+

0 commit comments

Comments
 (0)