β™ŠοΈ GemiNews πŸ—žοΈ (dev)

Demo 1: Embeddings + Recommendation Demo 2: Bella RAGa Demo 3: NewRetriever Demo 4: Assistant function calling

πŸ—žοΈStreaming LLM Responses

πŸ—ΏSemantically Similar Articles (by :title_embedding)

Streaming LLM Responses

2024-03-03 - Dave Kimura (from Drifitng ruby)

In this episode, we look at running a self hosted Large Language Model (LLM) and consuming it with a Rails application. We will use a background to make API requests to the LLM and then stream the responses in real-time to the browser.

[Technology] 🌎 https://www.driftingruby.com/episodes/streaming-llm-responses [🧠] [v2] article_embedding_description: {:llm_project_id=>"Unavailable", :llm_dimensions=>nil, :article_size=>412, :llm_embeddings_model_name=>"textembedding-gecko"}
[🧠] [v1/3] title_embedding_description: {:ricc_notes=>"[embed-v3] Fixed on 9oct24. Only seems incompatible at first glance with embed v1.", :llm_project_id=>"unavailable possibly not using Vertex", :llm_dimensions=>nil, :article_size=>412, :poly_field=>"title", :llm_embeddings_model_name=>"textembedding-gecko"}
[🧠] [v1/3] summary_embedding_description:
[🧠] As per bug https://github.com/palladius/gemini-news-crawler/issues/4 we can state this article belongs to titile/summary version: v3 (very few articles updated on 9oct24)

πŸ—Ώarticle.to_s

------------------------------
Title: Streaming LLM Responses
[content]
In this episode, we look at running a self hosted Large Language Model (LLM) and consuming it with a Rails application. We will use a background to make API requests to the LLM and then stream the responses in real-time to the browser.
[/content]

Author: Dave Kimura
PublishedDate: 2024-03-03
Category: Technology
NewsPaper: Drifitng ruby
{"id"=>4206,
"title"=>"Streaming LLM Responses",
"summary"=>nil,
"content"=>"In this episode, we look at running a self hosted Large Language Model (LLM) and consuming it with a Rails application. We will use a background to make API requests to the LLM and then stream the responses in real-time to the browser.",
"author"=>"Dave Kimura",
"link"=>"https://www.driftingruby.com/episodes/streaming-llm-responses",
"published_date"=>Sun, 03 Mar 2024 00:00:00.000000000 UTC +00:00,
"image_url"=>nil,
"feed_url"=>"https://www.driftingruby.com/episodes/streaming-llm-responses",
"language"=>nil,
"active"=>true,
"ricc_source"=>"feedjira::v1",
"created_at"=>Mon, 01 Apr 2024 20:13:18.804903000 UTC +00:00,
"updated_at"=>Mon, 21 Oct 2024 18:02:37.998198000 UTC +00:00,
"newspaper"=>"Drifitng ruby",
"macro_region"=>"Technology"}
Edit this article
Back to articles