Microsoft Research Blog

BenchmarkQED: Automated benchmarking of RAG systems

Image for: BenchmarkQED: Automated benchmarking of RAG systems
BenchmarkQED is an open-source toolkit for benchmarking RAG systems using automated query generation, evaluation, and dataset prep. It shows that LazyGraphRAG outperforms standard methods, especially on complex, global queries.

Recent Posts

Image for: Recent Posts
  1. Research Focus: Week of May 7, 2025 

    May 7, 2025

    In this issue: New research on compound AI systems and causal verification of the Confidential Consortium Framework; release of Phi-4-reasoning; enriching tabular data with semantic structure, and more.

  2. Research Focus: Week of April 21, 2025 

    April 23, 2025

    In this issue: our CHI 2025 & ICLR 2025 contributions, plus research on causal reasoning & LLMs; countering LLM jailbreak attacks; and how people use AI vs. AI-alone. Also, SVP of Microsoft Health Jim Weinstein talks rural healthcare innovation.

  3. Research Focus: Week of April 7, 2025 

    April 9, 2025

    In this issue: We introduce a new dataset designed to assist renewable energy infrastructure planners, a new method for denoising MRI imagery, and an AI tool for analyzing distant galaxies. Check out our latest research and other updates. 

Explore More

Image for: Explore More

Events & conferences 

Meet our community of researchers, learn about exciting research topics, and grow your network

Podcasts 

Ongoing conversations at the cutting edge of research

Microsoft Research Forum 

Join us for a continuous exchange of ideas about research in the era of general AI