Skip to content
@rungalileo

Galileo

Evaluate, observe, and protect your GenAI applications

Pinned Loading

  1. agent-leaderboard agent-leaderboard Public

    Ranking LLMs on agentic tasks

    Jupyter Notebook 140 13

  2. hallucination-index hallucination-index Public

    Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.

    110 7

  3. sdk-examples sdk-examples Public

    Examples on how to get started with the Galileo SDKs for AI Evaluation and Observability (both in Python and Typescript)

    Python 5

Repositories

Showing 10 of 37 repositories

Top languages

Loading…

Most used topics

Loading…