Post
Topic
Board Project Development
Re: Bitcointalk Search Project - Vod's Version
by
Joel_Jantsen
on 16/02/2025, 22:01:47 UTC
Yes, that is what I'm planning - they can chat with it just on bct subjects.   

I don't think any LLM would parse the entire bitcoin forum.  I'm not sure of the methods people use to populate, but I don't think it would be a spider crawl.  Instead, they would contact Google and other agencies for data dumps they could use.    The only way we can get a detailed LLM on bct is to train it ourselves.

I think the first thing to do will be create a small team.  Members will include content creators (parsers), the GUI designer and coders, both general compute and ML.  I would join the team just for my general knowledge of AI, but not as any kind of coder or policy maker.  The goal will be to created an uncensored search engine for this forum.   By uncensored, I mean deleted posts would be included, and not that private areas should be parsed.   Any search engine must respect the public visibility.

Does anyone want to champion this idea?
Got you! The fastest way to get access to the forum's public data would be pinging theymos and seeing if he wants to give the whole data dump from the database. Although, highly unlikely, that would be faster than any of the existing coumminty-created data sources. The LLM part should be pretty easy once the data is available as one would train it using any of the 100 open-source models. You'd need to train the model with updated information every few weeks to keep it ready with the latest information. For instance, what scam projects were flagged last week, if asked, the chatbot wouldn't have access to this information.

I'm not super into Data Science or ML side of things but if we had the data information, I could get this ready within a week or two max. Lot of readily available frameworks that provide plug-and-play interfaces for such bots.