A class that enables calls to the Cloudflare Workers AI API to access large language models in a chat-like fashion. It extends the SimpleChatModel class and implements the CloudflareWorkersAIInput interface.

Hierarchy

Implements

Constructors

Properties

ParsedCallOptions: Omit<BaseLanguageModelCallOptions, never>
baseUrl: string
caller: AsyncCaller

The async caller should be used by subclasses to make any async calls, which will thus benefit from the concurrency and retry logic.

model: string = "@cf/meta/llama-2-7b-chat-int8"
streaming: boolean = false
verbose: boolean

Whether to print out response text.

callbacks?: Callbacks
cloudflareAccountId?: string
cloudflareApiToken?: string
metadata?: Record<string, unknown>
tags?: string[]

Accessors

Methods

  • Stream all output from a runnable, as reported to the callback system. This includes all inner runs of LLMs, Retrievers, Tools, etc. Output is streamed as Log objects, which include a list of jsonpatch ops that describe how the state of the run has changed in each step, and the final state of the run. The jsonpatch ops can be applied in order to construct state.

    Parameters

    Returns AsyncGenerator<RunLogPatch, any, unknown>

Generated using TypeDoc