[WIP] Ai24 4 raise context limit #9

andychase · 2025-03-17T21:44:30Z

No description provided.

…tions_button

andychase · 2025-03-17T21:44:59Z

I created this just to review it ; Mike said it's not ready yet

andychase

Looks good Mike! I left some notes to help out John as he reviews.

I wasn't able to trigger the nice warning message just by copy/pasting in large amounts of text though eventually I got the alert popup. Putting in large messages though it did display. Maybe it's because of the "every 8 character" rule I didn't trigger it just copy/pasting into the text area. I do like the visual style and placement of the warning message and how the conversation becomes orange in the conversation tree though I noticed some weird behavior with that (see below).

Maybe Maureen could think about the wording to use "approx. characters left in conversation context:81588 ? " is what is says now. Maybe "Warning: you are approaching the number of words this model is able to handle. Consider starting a new conversation". Would be better? Just thinking many people are not tech savvy. Maybe a different message if the initial message is the large one "This message is too large for this model".

Keep it up Mike!

andychase · 2025-03-17T21:46:39Z

components/Chat/ChatInput.tsx

@@ -84,9 +97,38 @@ export const ChatInput = ({
    }

    setContent(value);
+
+    // only run the token count every 8 characters since it slows down the display of what's typed


I wonder what @Scr1ptcat thinks but consider just using the number of characters instead of calculating the tokens. Although the tokens is more accurate; given that this message is more of a warning and also that the server-side token counting may not match this "tiktok token" counting library anyway, I am not sure getting the 100% precise amount is needed.

I think as a heuristic 4 characters per token is used as you noted elsewhere.

Our GFE is quite slow and bogged down so even a little big of lagginess while people are typing would infuriate people I think.

I also think this would simplify the codebase a little bit.

andychase · 2025-03-17T21:47:01Z

components/Chat/ChatInput.tsx

+    handleChange(event);
+  }, [selectedConversation]);
+
+  //useEffect(() => {


Consider removing commented out code

andychase · 2025-03-17T21:52:39Z

pages/api/chat.ts

@@ -6,6 +6,14 @@ import tiktokenModel from '@dqbd/tiktoken/encoders/cl100k_base.json';
 import { NextApiRequest, NextApiResponse } from 'next';
 import { Tiktoken } from '@dqbd/tiktoken';

+export const config = {


I'm curious about this change!

andychase · 2025-03-17T21:53:33Z

types/openai.ts

@@ -18,7 +18,7 @@ export const OpenAIModels: Record<OpenAIModelID, OpenAIModel> = {
  [OpenAIModelID.GPT_4]: {
    id: OpenAIModelID.GPT_4,
    name: 'GPT-4',
-    maxLength: 128_000*3,
+    maxLength: 128_000 * 4,


Good catch. Originally they were magic numbers and it appeared to be 3x the tokenLimit. Not sure why Mckay (OG author) chose 3 instead of 4x as the character upper bound.

andychase · 2025-03-17T21:58:36Z

package-lock.json

-      "version": "1.0.14",
-      "resolved": "https://registry.npmjs.org/@dqbd/tiktoken/-/tiktoken-1.0.14.tgz",
-      "integrity": "sha512-R+Z1cVYOc8ZoDls6T2YhlUYrwKyuZoRJsSK3vN7iWWjBJ1xoX7e5BhUkEh5n6cXuMWQVUTHLlSDpnyv0Ye7xxw=="
+      "version": "1.0.20",


fyi @Scr1ptcat I did see an error trying this patch but npm i updating the dependancies resolved it

andychase · 2025-03-17T22:07:19Z

components/Chat/ChatInput.tsx

+          { (isPastCharacterCount || isHighCharacterCount) && (
+            <span className="helpCircle" title="Once past the context limit, the conversation will no longer produce responses relevant to content before the limit">&nbsp;&nbsp;?&nbsp;&nbsp;</span>
+          )}
+        </div>


FYI here's how it appears.

andychase · 2025-03-19T19:59:11Z

I wasn't able to trigger the nice warning message just by copy/pasting in large amounts of text though eventually I got the alert popup

I was able to get it now testing further, I just needed to type in text to trigger the once every 8 logic. Mike and I discussed using the throttling function from lodash to resolve this.

…/errors

…ts called.

I was getting weird errors so I simplified the logic on this page

This reverts commit 703efae.

andychase · 2025-03-26T23:33:26Z

I just reverted the local dev and file upload changes so this would be a clean patch/branch with this feature and then those reverts can be reverted again and tested after this patch! They are valuable changes.

andychase · 2025-03-26T23:33:36Z

Looks good to me

mTumbarello added 9 commits February 25, 2025 14:19

adding context limit and display message

cc58788

added token length for conversation context

07565a1

continuing work on context limit display

e5732a3

continuing high token count display feature

e9a2c12

continuing high token count display feature

c941b7c

continuing high token count display feature

eab6189

completed token count / context limit feature

3bbece0

Merge branch 'AI24-4_Raise_context_limit' into A!24-6_Export_conversa…

3ea54bd

…tions_button

removed unnecessary logging

33bc4a2

andychase assigned mTumbarello Mar 17, 2025

andychase commented Mar 17, 2025

View reviewed changes

removed old commented code and updated encoder base

230d2f3

mTumbarello and others added 13 commits March 19, 2025 18:24

removed message length alert and fixed 'selectedConversation' warning…

aa7b351

…/errors

added lodash throttle to limit the amount the token count function ge…

744b33f

…ts called.

adding local dev login for govchat dev environment

703efae

updating for Entra auth

45f584a

switched to character count instead of token count

9157cba

Fix conversation undefined coalescing

06cc6db

Remove debug console log

e128f14

Refactor maxLength in chatinput

b2740c7

I was getting weird errors so I simplified the logic on this page

Re-work visual style of too large prompt area

6729ef3

Cleanups

ceebe0b

Revert "adding local dev login for govchat dev environment"

376c5eb

This reverts commit 703efae.

Remove glitchy conversation color for now

fff3ab4

Remove unused import

e51864d

andychase marked this pull request as ready for review March 26, 2025 23:33

andychase merged commit efea71a into main Mar 26, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Ai24 4 raise context limit #9

[WIP] Ai24 4 raise context limit #9

andychase commented Mar 17, 2025

andychase commented Mar 17, 2025

andychase left a comment

andychase Mar 17, 2025

andychase Mar 17, 2025

andychase Mar 17, 2025

andychase Mar 17, 2025

andychase Mar 17, 2025

andychase Mar 17, 2025

andychase commented Mar 19, 2025

andychase commented Mar 26, 2025

andychase commented Mar 26, 2025

[WIP] Ai24 4 raise context limit #9

[WIP] Ai24 4 raise context limit #9

Conversation

andychase commented Mar 17, 2025

andychase commented Mar 17, 2025

andychase left a comment

Choose a reason for hiding this comment

andychase Mar 17, 2025

Choose a reason for hiding this comment

andychase Mar 17, 2025

Choose a reason for hiding this comment

andychase Mar 17, 2025

Choose a reason for hiding this comment

andychase Mar 17, 2025

Choose a reason for hiding this comment

andychase Mar 17, 2025

Choose a reason for hiding this comment

andychase Mar 17, 2025

Choose a reason for hiding this comment

andychase commented Mar 19, 2025

andychase commented Mar 26, 2025

andychase commented Mar 26, 2025