Commit graph

45 commits

Author SHA1 Message Date
Sweaterdog
da0722a8fb
Merge branch 'main' into Speech-to-Text 2025-06-07 14:59:35 -07:00
Sweaterdog
131dd45c9f
Merge branch 'main' into always-active-vision 2025-06-07 14:56:59 -07:00
google-labs-jules[bot]
857d14e64c I've enhanced logging, transformed thinking tags, and cleaned comments.
- I implemented universal logging for all API providers in src/models/, ensuring calls to logger.js for text and vision logs.
- I added transformation of <thinking>...</thinking> tags to <think>...</think> in all provider responses before logging, for correct categorization by logger.js.
- I standardized the input to logger.js's log() function to be a JSON string of the message history (system prompt + turns).
- I removed unnecessary comments from most API provider files, settings.js, and prompter.js to improve readability.

Note: I encountered some issues that prevented final comment cleanup for qwen.js, vllm.js, and logger.js. Their core logging functionality and tag transformations (for qwen.js and vllm.js) are in place from previous steps.
2025-06-07 20:47:26 +00:00
google-labs-jules[bot]
62bcb1950c I've integrated universal logging and applied some refactors.
I implemented comprehensive logging across all API providers in src/models/ using logger.js.
This includes:
- Adding log() and logVision() calls to each provider (Claude, DeepSeek, Gemini, GLHF, GPT, Grok, Groq, HuggingFace, Hyperbolic, Local, Mistral, Novita, Qwen, Replicate, VLLM).
- Ensuring logging respects 'log_normal_data', 'log_reasoning_data', and 'log_vision_data' flags in settings.js, which I added.
- I deprecated 'log_all_prompts' in settings.js and updated prompter.js accordingly.

I refactored openrouter.js and prompter.js:
- I removed the experimental reasoning prompt functionality ($REASONING) from openrouter.js.
- I removed a previously implemented (and then reverted) personality injection feature ($PERSONALITY) from prompter.js, openrouter.js, and profile files.

I had to work around some issues:
- I replaced the full file content for glhf.js and hyperbolic.js due to persistent errors with applying changes.

Something I still need to do:
- Based on your latest feedback, model responses containing <thinking>...</thinking> tags need to be transformed to <think>...</think> tags before being passed to logger.js to ensure they are categorized into reasoning_logs.csv. This change is not included in this update.
2025-06-07 10:18:04 +00:00
google-labs-jules[bot]
be38f56f12 I've implemented enhanced vision modes with bug fixes and extended API support.
This update finalizes the implementation of three distinct vision modes:
- "off": This disables all my vision capabilities.
- "prompted": (Formerly "on") This allows me to use vision via explicit commands from you (e.g., !lookAtPlayer), and I will then summarize the image.
- "always": (Formerly "always_active") I will automatically take a screenshot every time you send a prompt and send it with your prompt to a multimodal LLM. If you use a look command in this mode, I will only update my view and take a screenshot for the *next* interaction if relevant, without immediate summarization.

Here are the key changes and improvements:

1.  **Bug Fix (Image Path ENOENT)**:
    *   I've corrected `Camera.capture()` so it returns filenames with the `.jpg` extension.
    *   I've updated `VisionInterpreter.analyzeImage()` to handle full filenames.
    *   This resolves the `ENOENT` error that was previously happening in `Prompter.js`.

2.  **Vision Mode Renaming**:
    *   I've renamed the modes in `settings.js` and throughout the codebase: "on" is now "prompted", and "always_active" is now "always".

3.  **Core Framework (from previous work, now integrated)**:
    *   I've added `vision_mode` to `settings.js`.
    *   `Agent.js` now manages `latestScreenshotPath` and initializes `VisionInterpreter` with `vision_mode`.
    *   `VisionInterpreter.js` handles different behaviors for each mode.
    *   My vision commands (`!lookAt...`) respect the `off` mode.
    *   `History.js` stores `imagePath` with turns, and `Agent.js` manages this path's lifecycle.
    *   `Prompter.js` reads image files when I'm in "always" mode and passes `imageData` to model wrappers.

4.  **Extended Multimodal API Support**:
    *   `gemini.js`, `gpt.js`, `claude.js`, `local.js` (Ollama), `qwen.js`, and `deepseek.js` have been updated to accept `imageData` in their `sendRequest` method and format it for their respective multimodal APIs. They now include `supportsRawImageInput = true`.
    *   Other model wrappers (`mistral.js`, `glhf.js`, `grok.js`, etc.) now safely handle the `imageData` parameter in `sendRequest` (by ignoring it and logging a warning) and have `supportsRawImageInput = false` for that method, ensuring consistent behavior.

5.  **Testing**: I have a comprehensive plan to verify all modes and functionalities.

This set of changes provides a robust and flexible vision system for me, catering to different operational needs and supporting various multimodal LLMs.
2025-06-07 09:07:02 +00:00
Sweaterdog
9007a49ab3
Merge branch 'develop' into TTS 2025-03-16 22:57:27 -07:00
MaxRobinsonTheGreat
c5b860d624 fix busted up groq 2025-03-15 17:25:11 -05:00
MaxRobinsonTheGreat
d9a0b0802c Merge branch 'develop' into vision 2025-03-15 17:24:52 -05:00
Sweaterdog
64b284c0f2
Update groq.js
Fixed issue with groq.js
2025-03-14 13:28:01 -07:00
Sweaterdog
adce9982de
Update groq.js 2025-03-14 12:31:20 -07:00
Sweaterdog
360b937237
Merge branch 'develop' into TTS 2025-03-13 23:54:49 -07:00
Sweaterdog
21ee58fd8c
Update groq.js
Fixed an unclosed bracket
2025-03-09 00:45:30 -08:00
MaxRobinsonTheGreat
9abecae9b2 Merge branch 'main' into vision 2025-03-05 15:30:19 -06:00
Sweaterdog
7c3660e0f2
Merge branch 'main' into main 2025-03-04 16:43:56 -08:00
Copper
bad2cab931
Why have 'tools' if it's just null? 2025-02-19 17:26:45 -08:00
Copper
55d595121c
Acknowledge and apply requested changes 2025-02-19 17:24:18 -08:00
Sweaterdog
66ca5f7c4e
Add files via upload 2025-02-18 17:58:52 -08:00
FateUnix29
600df8e475 Groq updates 2025-02-17 15:49:52 -08:00
MaxRobinsonTheGreat
f8278c8a46 add error to groq 2025-02-17 16:47:45 -06:00
Sweaterdog
fb4b16e46c
Update groq.js 2025-02-11 21:21:26 -08:00
gmuffiness
430ae24d20 fix: use text description when vision features are used with a non-vision model 2025-02-10 02:03:25 +09:00
Sweaterdog
0fe7d11618
Merge branch 'main' into main 2025-02-08 22:59:58 -08:00
Sweaterdog
359c7e825c
Add files via upload 2025-02-08 22:41:07 -08:00
Sweaterdog
d3ad70da6c
Delete src directory 2025-02-08 22:38:37 -08:00
Sweaterdog
342ef1b473
Add files via upload 2025-02-08 22:30:36 -08:00
gmuffiness
a22f9d439f merge: main 2025-02-08 17:39:38 +09:00
MaxRobinsonTheGreat
3a780ef759 lil fixes: model names and params, imports 2025-02-05 15:14:33 -06:00
MaxRobinsonTheGreat
60187e2317 added model parameters obj to profile 2025-02-04 13:02:57 -06:00
Sweaterdog
6b8ac02c8e
Update groq.js
Removed custom logging
2025-01-29 15:52:10 -08:00
Sweaterdog
ec6f4f7098
Update groq.js
Fixed small error that would endlessly retry groqcloud response if Deepseek-R1 was chosen
2025-01-28 13:43:33 -08:00
Sweaterdog
2b3ca165e8
Update groq.js
Added deepseek-r1 support
2025-01-27 16:13:09 -08:00
gmuffiness
d1b3232a5a feat: add groq vision request 2025-01-27 17:29:14 +09:00
FateUnix29
cb85c8077b Some stuff. 2024-08-31 15:29:34 -07:00
FateUnix29
3fe7a1542b Try and add support for new models. 2024-08-25 13:16:32 -07:00
FateUnix29
de7280d8d0 Small tweaks, fix small bug with stop_seq 2024-08-25 00:42:37 -07:00
FateUnix29
eb3d3eac60 Local changes that I forgot to push? 2024-08-03 22:01:29 -07:00
FateUnix29
fda324845f reduced temperature, new sys prompt that works better 2024-06-18 21:15:42 -07:00
FateUnix29
10056c1d7b cleanup 2024-06-18 20:15:17 -07:00
Copper
31a1f47c0c
take advantage of high token limit 2024-06-18 19:59:41 -07:00
Copper
1351254cfc
you know how i said it wouldnt work
changed temperature from 0.85 to 0.45
2024-06-18 19:57:35 -07:00
FateUnix29
1ebe75e59e groq fixes 2024-06-18 19:48:16 -07:00
Copper
a6359afa8a
mess 2024-06-18 19:10:56 -07:00
Copper
1504805f70
im trying 2024-06-18 18:04:21 -07:00
Copper
6196d91077
import getkey 2024-06-18 17:56:49 -07:00
Copper
52bb0e009e
basic groq commit here 2024-06-18 17:56:04 -07:00