♊️ GemiNews 🗞️ (dev)

Demo 1: Embeddings + Recommendation Demo 2: Bella RAGa Demo 3: NewRetriever Demo 4: Assistant function calling

🗞️Streaming LLM Responses

🗿Semantically Similar Articles (by :title_embedding)

🗄️ 46.2 🔗 Mar29 GKE + Gemma + Ollama: The Power Trio for Flexible LLM Deployment (🧑🏻‍💻 Federico Iezzi)
🗄️ 47.4 🔗 Feb05 Visualize PaLM-based LLM tokens (🧑🏻‍💻 Guillaume Laforge)
🗄️ 50.9 🔗 Mar27 Enrich your streaming data using Bigtable and Dataflow (🧑🏻‍💻 Reza Rokni)
🗄️ 51.4 🔗 2023Dec18 Hands on Codelabs to dabble with Large Language Models in Java (🧑🏻‍💻 Guillaume Laforge)
🗄️ 52.0 🔗 Apr16 Fine tuning Gemma with LoRA on GCP (🧑🏻‍💻 pritam sahoo)

Streaming LLM Responses

2024-03-03 - Dave Kimura (from Drifitng ruby)

In this episode, we look at running a self hosted Large Language Model (LLM) and consuming it with a Rails application. We will use a background to make API requests to the LLM and then stream the responses in real-time to the browser.

[Technology] 🌎 https://www.driftingruby.com/episodes/streaming-llm-responses [🧠] [v2] article_embedding_description: {:llm_project_id=>"Unavailable", :llm_dimensions=>nil, :article_size=>412, :llm_embeddings_model_name=>"textembedding-gecko"}
[🧠] [v1/3] title_embedding_description: {:ricc_notes=>"[embed-v3] Fixed on 9oct24. Only seems incompatible at first glance with embed v1.", :llm_project_id=>"unavailable possibly not using Vertex", :llm_dimensions=>nil, :article_size=>412, :poly_field=>"title", :llm_embeddings_model_name=>"textembedding-gecko"}
[🧠] [v1/3] summary_embedding_description:
[🧠] As per bug https://github.com/palladius/gemini-news-crawler/issues/4 we can state this article belongs to titile/summary version: v3 (very few articles updated on 9oct24)

🗿article.to_s

------------------------------
Title: Streaming LLM Responses
[content]
In this episode, we look at running a self hosted Large Language Model (LLM) and consuming it with a Rails application. We will use a background to make API requests to the LLM and then stream the responses in real-time to the browser.
[/content]

Author: Dave Kimura
PublishedDate: 2024-03-03
Category: Technology
NewsPaper: Drifitng ruby

{"id"=>4206,
"title"=>"Streaming LLM Responses",
"summary"=>nil,
"content"=>"In this episode, we look at running a self hosted Large Language Model (LLM) and consuming it with a Rails application. We will use a background to make API requests to the LLM and then stream the responses in real-time to the browser.",
"author"=>"Dave Kimura",
"link"=>"https://www.driftingruby.com/episodes/streaming-llm-responses",
"published_date"=>Sun, 03 Mar 2024 00:00:00.000000000 UTC +00:00,
"image_url"=>nil,
"feed_url"=>"https://www.driftingruby.com/episodes/streaming-llm-responses",
"language"=>nil,
"active"=>true,
"ricc_source"=>"feedjira::v1",
"created_at"=>Mon, 01 Apr 2024 20:13:18.804903000 UTC +00:00,
"updated_at"=>Mon, 21 Oct 2024 18:02:37.998198000 UTC +00:00,
"newspaper"=>"Drifitng ruby",
"macro_region"=>"Technology"}

Edit this article

Back to articles