mcq-framework

This repository began in March 2026 as a research project measuring positional bias in LLM multiple-choice answering and testing whether prompting interventions reduce that bias.

It has since been split into two standalone paper repos:

cotenthusiast/two-stage-prompting-paper — sourced from the two-stage-prompting branch
cotenthusiast/model-generalization-paper — sourced from the model-generalization branch

This repo is now being repurposed as mcq-framework, a reusable MCQ evaluation framework for LLM robustness research. It is currently early stage and not yet ready for use.

Existing branches are preserved as historical record.

For current research work, see the two paper repos linked above.

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
config		config
data		data
prompts/v1		prompts/v1
runs		runs
scripts		scripts
src/twoprompt		src/twoprompt
tests		tests
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
main.tex		main.tex
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mcq-framework

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

mcq-framework

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages