This is the custom benchmark framework frmo the lm-evaluation-harness for our labs LLM-related projects.

Installation

Follow the installation instructions for compatibility

conda create --name bids_lm_eval python=3.12
conda activate bids_lm_eval
git clone --depth 1 git@github.com:BIDS-Xu-Lab/bids-lm-evaluation.git
cd bids-lm-evaluation
pip install uv
uv pip install -e .
uv pip install "lm_eval[vllm]" "vllm==0.8.5" # for MOE model compatibility, do not support gpt-oss quantization

Usage

To use existing tasks, check out lm_eval/tasks folder.
Add new tasks to the bids_tasks folder and make sure to add include_path for this folder in testing lm_eval command.
check out the jobs_run folder to find the scripts run on specific YCRC HPC

Name		Name	Last commit message	Last commit date
Latest commit History 3,882 Commits
.github/workflows		.github/workflows
bids_tasks/ehr_llm		bids_tasks/ehr_llm
docs		docs
examples		examples
jobs_run		jobs_run
lm_eval		lm_eval
results/ehr_llm/ehrshot		results/ehr_llm/ehrshot
results_process		results_process
scripts		scripts
templates/new_yaml_task		templates/new_yaml_task
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CITATION.bib		CITATION.bib
CODEOWNERS		CODEOWNERS
LICENSE.md		LICENSE.md
MANIFEST.in		MANIFEST.in
ORIGINAL_README.md		ORIGINAL_README.md
README.md		README.md
ignore.txt		ignore.txt
pile_statistics.json		pile_statistics.json
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

This is the custom benchmark framework frmo the lm-evaluation-harness for our labs LLM-related projects.

Installation

Usage

About

Uh oh!

Releases

Packages

Languages

License

BIDS-Xu-Lab/bids-lm-evaluation

Folders and files

Latest commit

History

Repository files navigation

This is the custom benchmark framework frmo the lm-evaluation-harness for our labs LLM-related projects.

Installation

Usage

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages