`Athena`: Refactor LLM Configuration to YAML-Based System #92

LeonWehrhahn · 2025-04-13T16:50:06Z

Motivation and Context

This PR rewrites the llm_core module configuration system to address current limitations. The core motivation behind these changes is threefold:

Granular LLM Model Selection for Tasks: We need the ability to specify different LLM models for different tasks. For example, using a high-powered, but potentially more costly, LLM model for low-volume complex operations like generating initial structured grading instructions, while employing a faster, more economical LLM model for high-volume tasks like generating feedback on individual student submissions.
Flexible and Comprehensive LLM Model Configuration: We need the ability to configure not only the LLM model to use but also its inherent capabilities (e.g., whether it supports function calling or structured output) and default settings (e.g., temperature, top_p). This is crucial for supporting a diverse range of LLM models.
Preserved Dynamic Configuration Overrides via Headers: While not a new feature, we want to retain the existing ability to dynamically override LLM model configurations via x- headers in API requests, as used in the Athena playground.

Description

To achieve the outlined goals we introduced two YAML files to manage model configurations and capabilities:

llm_capabilities.yml (llm_core): This file defines the core capabilities of different LLM models. It specifies default settings (like temperature, top_p) and flags for supported features (like supports_function_calling, supports_structured_output). Importantly, it also allows for LLM model-specific overrides to these defaults. This file resides at the top level of the llm_core directory and is therefore the same fore ach module - (e.g., module_modeling_llm, module_programming_llm).
llm_config.yml (module-specific): Each module (e.g., module_modeling_llm, module_programming_llm) now has its own llm_config.yml located at the root level of the module. This file specifies the concrete models to be used for different tasks within that module. For example, the modeling module might specify a powerful model like openai_o1 for generating grading instructions and a faster, more economical model like openai_4o for generating feedback. Switching from environment variables to module-level YAML files for LLM configuration brings these settings under version control, ensuring consistent deployments and eliminating the risk of environment-specific discrepancies.

A lot of other aspects of the llm_module were changed to support this new YAML-based configuration approach. These changes are outlined in more detail in the README.

Steps for Testing

Verify Model Configuration:
- Ensure that the llm_config.yml and llm_capabilities.yml files are correctly parsed.
- Check that different modules (e.g., module_modeling_llm) can successfully load and use their specified model configurations.
Test Feedback Generation:
- Test if the Feedback Generation is still working in each module (e.g., module_modeling_llm, module_programming_llm).

Testserver States

Note

These badges show the state of the test servers.
Green = Currently available, Red = Currently locked
Click on the badges to get to the test servers.

Screenshots

Summary by CodeRabbit

New Features
- Introduced dynamic, provider-agnostic configuration and loading of language models (OpenAI, Azure, Ollama) via new YAML config files and modular provider support.
- Added support for specifying multiple model roles (base, mini, fast reasoning, long reasoning) per module.
Improvements
- Enhanced modularity and flexibility in model selection for all LLM-based modules.
- Refactored model configuration and prompt handling for improved clarity, maintainability, and extensibility.
- Simplified prompt construction and prediction calls by removing explicit output formatting and function calling flags.
- Updated environment variable samples and documentation for easier integration with new providers.
- Improved concurrency and error handling in feedback generation workflows.
- Added dynamic argument handling to suggestion generation based on method signatures.
- Added checks for null prediction results to prevent downstream errors.
Bug Fixes
- Improved error handling and logging for model discovery and prediction processes.
Documentation
- Added comprehensive README and configuration file documentation for LLM integration.
Style
- Reformatted code and configuration files for consistency and readability.
- Reformatted logging calls and function signatures for clarity.
Refactor
- Replaced static model configuration with dynamic, runtime-loaded configurations.
- Updated function and method signatures for clarity and explicitness.
- Removed deprecated or redundant fields and logic related to output formatting.
- Consolidated and simplified model configuration imports and exports.
- Removed obsolete model integration modules and replaced with provider-specific configs.
- Simplified prompt utilities by removing conditional formatting logic.
- Replaced strategy factory pattern with direct approach implementation registry and dynamic dispatch.
Chores
- Updated dependencies (e.g., OpenAI package version).
- Added and updated example environment and configuration files.
- Added Poetry virtual environment configuration files for multiple modules.
- Updated VSCode workspace settings for consistent Python interpreter paths.

…apping and uniqueness resolution

…tum/Athena into feature/modeling/reference

…ion logic

…tum/Athena into feature/modeling/reference

…n feedback model conversion

…tionships; fix foreign key references and ensure proper inheritance structure.

…remove debug prints, update caching logic, and change serialization method for structured grading instructions

…d usage guidelines

…tions

Repositories: Athena Repositories without this branch: Pyris Atlas

ahmetsenturk

the following things are ok ✅

code lgtm,
tested locally with Artemis, request AI feedback worked with programming/modeling/text exercises,
playground with text

followings are failing ❌

programming and modeling on playground:

p.s., this would still intensive testing from as it touches almost the whole Athena :)

github-actions · 2025-06-16T16:13:32Z

⚠️ Unable to deploy to test server ⚠️

"Athena - Test 1" is already in use by PR #172.

…raise an error if none are available

github-actions · 2025-06-25T12:02:18Z

There hasn't been any activity on this pull request recently. Therefore, this pull request has been automatically marked as stale and will be closed if no further activity occurs within seven days. Thank you for your contributions.

LeonWehrhahn and others added 30 commits November 16, 2024 19:30

Refactor feedback reference

f10f825

Merge branch 'develop' into feature/modeling/reference

6980af1

Enhance Apollon JSON transformer and parser for improved element ID m…

3d1836a

…apping and uniqueness resolution

Merge branch 'feature/modeling/reference' of https://github.com/ls1in…

78ab7d3

…tum/Athena into feature/modeling/reference

Merge branch 'develop' into feature/modeling/reference

ecb0eac

Merge branch 'develop' into feature/modeling/reference

4cdd0c4

Add element_ids field to ModelingFeedback and update feedback convers…

1dac059

…ion logic

Merge branch 'feature/modeling/reference' of https://github.com/ls1in…

49114e0

…tum/Athena into feature/modeling/reference

Add element_ids field to DBModelingFeedback model

44823e5

Add JSON type import to db_modeling_feedback.py

f2f378f

Merge branch 'develop' into feature/modeling/reference

efc39fa

Merge branch 'develop' into feature/modeling/reference

e574ba5

add structured grading instruction cache

5f5d519

Increase default max_tokens to 4000 in OpenAI model configuration

dff7dbe

Increase max_input_tokens to 5000 and update element_ids assignment i…

7cc4a8a

…n feedback model conversion

Refactor exercise models to implement polymorphism and establish rela…

21ac44e

…tionships; fix foreign key references and ensure proper inheritance structure.

Merge branch 'feature/modeling/reference' into feature/modeling/caching

aa2b5de

Refactor exercise storage and structured grading criterion handling; …

56afe8b

…remove debug prints, update caching logic, and change serialization method for structured grading instructions

Merge branch 'develop' into feature/modeling/caching

f86f53d

Fix pylint errors

8bea668

Add LLM configuration files; refactor model handeling and prompts

b6864d7

Merge remote-tracking branch 'origin/develop' into feature/model-choice

8255725

Refactor model configuration types to use ModelConfigType

49769e0

Add README for llm_core module; refactor OpenAI model config structure

5169aad

Enhance README for llm_core module with detailed content structure an…

da7fea6

…d usage guidelines

Refactor model configs

cfc78f4

Merge branch 'develop' into feature/model-choice

891aad1

Add AzureModelConfig to ModelConfigType for multi-provider support

b555c61

Remove use_function_calling parameter from suggestion generation func…

f0be896

…tions

Merge branch 'athena/feature/model-choice' from multiple repositories

3f297dc

Repositories: Athena Repositories without this branch: Pyris Atlas

github-actions bot added lock:athena-test1 Is currently deployed to Athena Test Server 1 and removed deploy:athena-test1 Athena Test Server 1 labels Jun 16, 2025

LeonWehrhahn removed the lock:athena-test1 Is currently deployed to Athena Test Server 1 label Jun 16, 2025

ahmetsenturk requested changes Jun 16, 2025

View reviewed changes

LeonWehrhahn added the deploy:athena-test1 Athena Test Server 1 label Jun 16, 2025

LeonWehrhahn had a problem deploying to Athena - Test 1 June 16, 2025 16:12 — with GitHub Actions Failure

github-actions bot removed the deploy:athena-test1 Athena Test Server 1 label Jun 16, 2025

github-actions bot added the deployment-error label Jun 16, 2025

ensure only available model configurations are properly appended and …

09de912

…raise an error if none are available

LeonWehrhahn requested a review from ahmetsenturk June 17, 2025 10:23

LeonWehrhahn removed the deployment-error label Jun 17, 2025

Merge branch 'main' into athena/feature/model-choice

247e10a

LeonWehrhahn added the deploy:athena-test1 Athena Test Server 1 label Jun 17, 2025

LeonWehrhahn temporarily deployed to Athena - Test 1 June 17, 2025 12:20 — with GitHub Actions Inactive

github-actions bot added lock:athena-test1 Is currently deployed to Athena Test Server 1 and removed deploy:athena-test1 Athena Test Server 1 labels Jun 17, 2025

maximiliansoelch added deploy:athena-test1 Athena Test Server 1 and removed lock:athena-test1 Is currently deployed to Athena Test Server 1 labels Jun 17, 2025

maximiliansoelch temporarily deployed to Athena - Test 1 June 17, 2025 13:50 — with GitHub Actions Inactive

github-actions bot added lock:athena-test1 Is currently deployed to Athena Test Server 1 and removed deploy:athena-test1 Athena Test Server 1 labels Jun 17, 2025

LeonWehrhahn added deploy:athena-test1 Athena Test Server 1 and removed lock:athena-test1 Is currently deployed to Athena Test Server 1 labels Jun 17, 2025

LeonWehrhahn deployed to Athena - Test 1 June 17, 2025 14:32 — with GitHub Actions View deployment

github-actions bot added lock:athena-test1 Is currently deployed to Athena Test Server 1 and removed deploy:athena-test1 Athena Test Server 1 labels Jun 17, 2025

ahmetsenturk approved these changes Jun 18, 2025

View reviewed changes

github-actions bot added the stale label Jun 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`Athena`: Refactor LLM Configuration to YAML-Based System #92

`Athena`: Refactor LLM Configuration to YAML-Based System #92

Uh oh!

LeonWehrhahn commented Apr 13, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

ahmetsenturk left a comment

Uh oh!

github-actions bot commented Jun 16, 2025

Uh oh!

github-actions bot commented Jun 25, 2025

Uh oh!

Uh oh!

Athena: Refactor LLM Configuration to YAML-Based System #92

Are you sure you want to change the base?

Athena: Refactor LLM Configuration to YAML-Based System #92

Uh oh!

Conversation

LeonWehrhahn commented Apr 13, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation and Context

Description

Steps for Testing

Testserver States

Screenshots

Summary by CodeRabbit

Uh oh!

ahmetsenturk left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Jun 16, 2025

⚠️ Unable to deploy to test server ⚠️

Uh oh!

github-actions bot commented Jun 25, 2025

Uh oh!

Uh oh!

`Athena`: Refactor LLM Configuration to YAML-Based System #92

`Athena`: Refactor LLM Configuration to YAML-Based System #92

LeonWehrhahn commented Apr 13, 2025 •

edited by coderabbitai bot

Loading