As I understand, it will be like DeepSeek. It learns the whole forum, every post and when you ask a question, it will try to answer you from all the information that Bitcointalk carries, right? It sounds interesting but I wonder how it will be able to filter information well, there are many wrong and many right answers.
Correct - it has the same
model so it will provide the same type of responses DS does, however it's
knowledge will be limited to the posts from this forum. I don't think anyone would use it for crypto research, but it's useful for crypto activities on this forum. For example: a person would not use it to discuss improved methods of storing key phrases, but could use it to get of list of key phrase discussions that involve a certain offline wallet. It could produce probability that two users are the same, based on their posts or blockchain activity, or give a loan risk rating of a user.
This is where the project can pay for itself. We can offer community members so many tokens per day, and charge for excess or non-personal use. This forum was a good development tool in it's non-greedy days, and many developers still look to it.
Btw I like the idea, I can work on UI/UX design.
The UI already looks just like deepseek. You will need to figure out what controls will work best - we don't want to complicate it so we don't need to update it. For example, is it worth it to have a dropdown multi-select showing the categories? Then you can ask the AI your question, and add "in these categories only". ?
I have about half of the forum's posts (as of 2025-01-01), it will take a few weeks for me to fetch the other half.
Your new data set contains a copy of the recent forum posts. Archived copies contain an older data set. By comparing these datasets, the AI can determine deleted posts, edited posts, etc. I'm curious about the parameters you are collecting, but I'll discuss that in your thread. I'd like this thread to stay about the actual search engine/AI, and not the data collection behind it.