Post
Topic
Board Project Development
Re: Bitcointalk Search Project - Vod's Version
by
NeuroticFish
on 14/02/2025, 20:28:05 UTC
Hi Vod, if I understood correctly, you want to create an LLM based on the Bitcointalk forum data you've parsed? The LLM could be trained on this data and users can chat with it just like ChatGpt. Doesn't ChatGpt already scrape the web pages citing sources and have access to all the publicly available data? Unless the data we're talking about is private I guess any of these LLMs that scan the webpages for information should act as a search engine. We can create something just inclined towards Bitcointalk and trained exclusively on forum-related information as a hobby project.

Normally the public AI only uses old data - years or months old. If Vod can feed it with very new data, it's already a huge step forward.

I really have no idea how the machine part of deepseek works - I'm actually taking a beginner course on machine learning so I can understand what this is doing

You may want to also read about LLM hallucinations. I don't know how badly is DeepSeek affected, but I've seen some incredibly bad ones at Gemini and Copilot.