Extend support for `useFileOutput` to `stream` #309

mattt · 2024-09-23T12:01:18Z

No description provided.

lib/stream.js

aron · 2024-09-23T14:53:48Z

lib/stream.js

+        if (
+          useFileOutput &&
+          typeof data === "string" &&
+          (data.startsWith("https:") || data.startsWith("data:"))


I think we want to explicitly match that data consists of only a valid data uri. There is a reasonable chance that a language model might emit a data: line that starts with "data:" but less so that it will emit a line that consists only of a well formed data-uri.

🤔 That said, I did think today after reading this post on other AI apis that we should move to structured outputs.

Perhaps the file stream should emit JSON.

data: {"type": "url", value: "data://..."}

And in future we refactor the text streaming interface to do the same:

data: {"type": "string", "data: and some more text"}

I think we want to explicitly match that data consists of only a valid data uri. There is a reasonable chance that a language model might emit a data: line that starts with "data:" but less so that it will emit a line that consists only of a well formed data-uri.

That's a good callout. The trick is finding a good way to validate without parsing the whole thing and throwing away the result. I think we can still read lazily if we use a regex to apply some heuristics about its first chunk of content.

🤔 That said, I did think today after reading this post on other AI apis that we should move to structured outputs.

Perhaps the file stream should emit JSON.

data: {"type": "url", value: "data://..."}

And in future we refactor the text streaming interface to do the same:

data: {"type": "string", "data: and some more text"}

I see the advantages of typed outputs, but also quite like the experience we have now of emitting raw tokens. In any case, structured outputs would be a backwards-incompatible change, so we'd have to be clever about a migration.

One way we could support both would be keeping Accept: text/event-stream as-is, but adding support for Accept: text/event-stream+json. The client libraries could start sending that as needed to opt into structured outputs.

mattt added 4 commits September 23, 2024 04:56

Break stream on done event without enqueuing it

254b7df

Fix doc comment for fetch parameter

4430e6b

Transform URLs in streaming responses when useFileOutput is enabled

139fb89

Revert 'Break stream on done event without enqueuing it'

6a2156d

mattt commented Sep 23, 2024

View reviewed changes

lib/stream.js Outdated Show resolved Hide resolved

mattt requested a review from aron September 23, 2024 13:09

aron reviewed Sep 23, 2024

View reviewed changes

Default to useFileOutput = true for readable streams

51d300b

mattt merged commit 7f02a64 into main Sep 25, 2024
19 checks passed

mattt deleted the mattt/stream-file branch September 25, 2024 18:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend support for `useFileOutput` to `stream` #309

Extend support for `useFileOutput` to `stream` #309

mattt commented Sep 23, 2024

aron Sep 23, 2024

mattt Sep 25, 2024

Extend support for useFileOutput to stream #309

Extend support for useFileOutput to stream #309

Conversation

mattt commented Sep 23, 2024

aron Sep 23, 2024

Choose a reason for hiding this comment

mattt Sep 25, 2024

Choose a reason for hiding this comment

Extend support for `useFileOutput` to `stream` #309

Extend support for `useFileOutput` to `stream` #309