Mechanistic Analysis of Alignment Algorithms in Language Models Paper • 2606.09850 • Published May 9 • 2
Evaluation Cards: An Interpretive Layer for AI Evaluation Reporting Paper • 2606.09809 • Published 5 days ago • 4
ECI_{sem}: Semantic Residual Effective Contrastive Information for Evaluating Hard Negatives Paper • 2603.20990 • Published 8 days ago • 1
$\mathrm{ECI}_{\mathrm{sem}}$: Semantic Residual Effective Contrastive Information for Evaluating Hard Negatives Paper • 2603.20990 • Published 8 days ago • 1
BiCA: Effective Biomedical Dense Retrieval with Citation-Aware Hard Negatives Paper • 2511.08029 • Published Nov 11, 2025 • 4
ViBe: A Text-to-Video Benchmark for Evaluating Hallucination in Large Multimodal Models Paper • 2411.10867 • Published Nov 16, 2024 • 9