Alternative LLM Predictor Implementation

2025-05-02 13:38:42 +02:00
parent 1ac23eb252
commit 667f119c61
15 changed files with 1454 additions and 100 deletions
--- a/Documentation/01_User_Guide/04_Configuration_and_Presets.md
+++ b/Documentation/01_User_Guide/04_Configuration_and_Presets.md
@@ -13,6 +13,18 @@ The `app_settings.json` file is structured into several key sections, including:
 *   `ASSET_TYPE_DEFINITIONS`: Defines known asset types (like Surface, Model, Decal) and their properties.
 *   `MAP_MERGE_RULES`: Defines how multiple input maps can be merged into a single output map (e.g., combining Normal and Roughness into one).

+### LLM Predictor Settings
+
+For users who wish to utilize the experimental LLM Predictor feature, the following settings are available in `config/app_settings.json`:
+
+*   `llm_endpoint_url`: The URL of the LLM API endpoint. For local LLMs like LM Studio or Ollama, this will typically be `http://localhost:<port>/v1`. Consult your LLM server documentation for the exact endpoint.
+*   `llm_api_key`: The API key required to access the LLM endpoint. Some local LLM servers may not require a key, in which case this can be left empty.
+*   `llm_model_name`: The name of the specific LLM model to use for prediction. This must match a model available at your specified endpoint.
+*   `llm_temperature`: Controls the randomness of the LLM's output. Lower values (e.g., 0.1-0.5) make the output more deterministic and focused, while higher values (e.g., 0.6-1.0) make it more creative and varied. For prediction tasks, lower temperatures are generally recommended.
+*   `llm_request_timeout`: The maximum time (in seconds) to wait for a response from the LLM API. Adjust this based on the performance of your LLM server and the complexity of the requests.
+
+Note that the `llm_predictor_prompt` and `llm_predictor_examples` settings are also present in `app_settings.json`. These define the instructions and examples provided to the LLM for prediction. While they can be viewed here, they are primarily intended for developer reference and tuning the LLM's behavior, and most users will not need to modify them.
+
 ## GUI Configuration Editor

 You can modify the `app_settings.json` file using the built-in GUI editor. Access it via the **Edit** -> **Preferences...** menu.
--- a/Documentation/01_User_Guide/05_Usage_GUI.md
+++ b/Documentation/01_User_Guide/05_Usage_GUI.md
@@ -18,7 +18,7 @@ python -m gui.main_window
    *   **Preset List:** Create, delete, load, edit, and save presets. On startup, the "-- Select a Preset --" item is explicitly selected. You must select a specific preset from this list to load it into the editor below, enable the detailed file preview, and enable the "Start Processing" button.
    *   **Preset Editor Tabs:** Edit the details of the selected preset.
 *   **Processing Panel (Right):**
-    *   **Preset Selector:** Choose the preset to use for *processing* the current queue.
+    *   **Preset Selector:** Choose the preset to use for *processing* the current queue. This dropdown now includes a new option: "- LLM Interpretation -". Selecting this option will use the experimental LLM Predictor instead of the traditional rule-based prediction system defined in presets.
    *   **Output Directory:** Set the output path (defaults to `config/app_settings.json`, use "Browse...")
    *   **Drag and Drop Area:** Add asset `.zip`, `.rar`, `.7z` files, or folders by dragging and dropping them here.
    *   **Preview Table:** Shows queued assets in a hierarchical view (Source -> Asset -> File). Initially, this area displays a message prompting you to select a preset. Once a preset is selected from the Preset List, the detailed file preview will load here. The mode of the preview depends on the "View" menu:
@@ -32,7 +32,8 @@ python -m gui.main_window
        *   `Clear Queue`: Button to clear the queue and preview.
        *   `Start Processing`: Button to start processing the queue. This button is disabled until a valid preset is selected from the Preset List.
        *   `Cancel`: Button to attempt stopping processing.
-*   **Status Bar:** Displays current status, errors, and completion messages.
+        *   **Re-interpret Selected with LLM:** This button appears when the "- LLM Interpretation -" preset is selected. It allows you to re-process only the currently selected items in the Preview Table using the LLM, without affecting other items in the queue. This is useful for refining predictions on specific assets.
+*   **Status Bar:** Displays current status, errors, and completion messages. During LLM processing, the status bar will show messages indicating the progress of the LLM requests.

 ## GUI Configuration Editor