confidence-calibration

[ICCV 2025 CVAMD] The official implementation of the paper "Prompt4Trust: A Reinforcement Learning Prompt Augmentation Framework for Clinically-Aligned Confidence Calibration in Multimodal Large Language Models".

machine-learning reinforcement-learning pytorch medical-imaging uncertainty-quantification confidence-calibration trustworthy-ai large-language-models prompt-engineering multimodal-large-language-models

Updated Dec 11, 2025
Python

HKUST-KnowComp / MarConf

Star

[ACL 2025] Revisiting Epistemic Markers in Confidence Estimation: Can Markers Accurately Reflect Large Language Models' Uncertainty?.

uncertainty-estimation confidence-estimation epistemic-uncertainty confidence-calibration

Updated Apr 14, 2026
Python

xingbpshen / nested-diffusion

Star

[IEEE Trans. Med. Imaging] The official implementation of the paper "Improving Robustness and Reliability in Medical Image Classification with Latent-Guided Diffusion and Nested-Ensembles".

machine-learning pytorch medical-imaging ensemble-learning uncertainty-quantification diffusion-models confidence-calibration trustworthy-ai

Updated Dec 11, 2025
Python

martinferianc / noise

Star

Investigation of how noise perturbations impact neural network calibration and generalisation

machine-learning neural-network noise generalisation confidence-calibration

Updated Mar 26, 2024
Shell

EFS-OpenSource / Thetis

Star

Service to examine data processing pipelines (e.g., machine learning or deep learning pipelines) for uncertainty consistency (calibration), fairness, and other safety-relevant aspects.

machine-learning validation ai deep-learning dataset neural-networks traceability fairness data-quality robustness fairness-ai explainability fairness-ml uncertainty-calibration confidence-calibration robustness-ml robustness-ai

Updated Mar 13, 2026
Python

xingbpshen / medical-calibration-fairness-mllm

Star

[MICCAI 2025] The official implementation of the paper "Exposing and Mitigating Calibration Biases and Demographic Unfairness in MLLM Few-Shot In-Context Learning for Medical Image Classification".

machine-learning pytorch medical-imaging uncertainty-quantification fairness-ml responsible-ai confidence-calibration trustworthy-ai large-language-models multimodal-large-language-models

Updated Dec 11, 2025
Python

sleep3r / garrus

Star

Python framework for high quality confidence estimation of deep neural networks, providing methods such as confidence calibration and ordinal ranking

python deep-neural-networks deep-learning pytorch python-framework confidence-estimation mahine-learning confidence-calibration confidence-ranking

Updated Dec 25, 2024
Python

aperry938 / movecalibrate

Star

Adaptive movement rehabilitation with confidence calibration — novel Movement Calibration Gap metric combining real-time pose estimation with metacognitive self-assessment

react typescript spaced-repetition biomechanics human-computer-interaction pose-estimation rehabilitation privacy-preserving adaptive-systems tensorflow-js mediapipe movement-analysis kinesiology confidence-calibration

Updated Mar 22, 2026
TypeScript

lahavdabah / TS4CP

Star

Code for enhancing Conformal Prediction using Temperature Scaling. Explore more of our work at:

classification uncertainty-quantification icml conformal-prediction temperature-scaling confidence-calibration icml-2025

Updated Jun 13, 2025
Python

Snehgabani / elite-reasoning-mcp

Star

73-tool MCP server that makes any LLM think harder, reason better, and never repeat mistakes. Works with Cursor, Antigravity, VS Code, and any MCP-compatible IDE.

Updated Jun 16, 2026
Python

camerontjs-dot / career-decision-engine

Star

Dependency-free decision-support tool for comparing job offers and career paths with relative scoring, rule checks, calibrated uncertainty, and validation sweeps.

javascript browser validation uncertainty no-dependencies decision-support confidence-calibration

Updated May 6, 2026
JavaScript

abisliouk / HS-MATH-LLM

Star

Evaluate high school math reasoning in LLMs with baseline and Chain-of-Thought (CoT) prompts. Includes confidence calibration metrics, JSON output parsing, and reliability analysis.

openai gpt json-parsing model-evaluation interpretability reliability-analysis confidence-calibration llm prompt-engineering chain-of-thought-reasoning safe-ai

Updated May 29, 2025
Python

Hkd225 / Uncertainly-Engine

Star

Uncertainly Engine - AI framework for uncertainty estimation, confidence scoring, hallucination detection, and reliable LLM outputs.

machine-learning ai decision-making technology artificial-intelligence bayesian uncertainty-estimation ai-agents rag uncertainty-estimations confidence-calibration llm generative-ai hallucination-detection

Updated Jun 20, 2026
Python

kent-tokyo / masstrust

Star

Calibrated trust and abstention for MS/MS molecular annotations

rust cli metabolomics mass-spectrometry msms abstention confidence-calibration selective-prediction risk-coverage

Updated Jun 27, 2026
Rust

Anbu-00001 / deep-trust

Star

Multimodal deepfake detection with explainable AI, robustness validation, and calibrated trust scoring for real-world media.

react computer-vision serverless audio-analysis deepfake-detection confidence-calibration multimodal-ai robustness-testing media-authentication

Updated Mar 15, 2026
TypeScript

ThePharmer / fade

Star

FADE: AI that deliberately forgets like humans do, using memory degradation as intrinsic confidence signal. Reduces hallucinations, enables epistemic humility, solves stateful deployment. Conceptual proposal seeking implementation and validation.

machine-learning language-models research-proposal rag ai-alignment confidence-calibration retrieval-augmented-generation memory-architecture epistemic-humility

Updated Nov 25, 2025
Python

Improve this page

Add a description, image, and links to the confidence-calibration topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the confidence-calibration topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

confidence-calibration

Here are 33 public repositories matching this topic...

JIA-Lab-research / MiSLAS

Mr-Loevan / VL-Calibration

Impression2805 / FMFP

tor4z / awesome-confidence-calibration

xingbpshen / prompt4trust

HKUST-KnowComp / MarConf

xingbpshen / nested-diffusion

martinferianc / noise

EFS-OpenSource / Thetis

xingbpshen / medical-calibration-fairness-mllm

sleep3r / garrus

aperry938 / movecalibrate

lahavdabah / TS4CP

Snehgabani / elite-reasoning-mcp

camerontjs-dot / career-decision-engine

abisliouk / HS-MATH-LLM

Hkd225 / Uncertainly-Engine

kent-tokyo / masstrust

Anbu-00001 / deep-trust

ThePharmer / fade

Improve this page

Add this topic to your repo