Train ChatGPT on my forum DB

dvduval

Active member
Is there such a tool available yet or has anyone done it. There is so much content on my forum from over 22 years! I would love to ask questions to my forum!
 
Do you use elastic search? It gives great search results for your members on your own site.

I think ChatGPT and other AI/AI models are not good for our forums. They grab content and potential members then find the content directly on the AI search engine sites or on ChatGPT or similar. Why should users still visit your site?
I don't think external AI will have a positive effect on our forums.

The other point is the law - copyright and not one KI pays for our members content they use to train there KI models.

It's like raising and feeding your own executioner... ;)
 
Do you use elastic search? It gives great search results for your members on your own site.

I think ChatGPT and other AI/AI models are not good for our forums. They grab content and potential members then find the content directly on the AI search engine sites or on ChatGPT or similar. Why should users still visit your site?
I don't think external AI will have a positive effect on our forums.

The other point is the law - copyright and not one KI pays for our members content they use to train there KI models.

It's like raising and feeding your own executioner... ;)

I am talking about training on my DB only, not sharing the content outside the forums, though I'm sure the spiders have already paid a visit to my forums. I can't control that except if I close the forums to the public.
 
I don't think external AI will have a positive effect on our forums.
But this is done in any case, except you hide all content from public. Which makes no sense.

The other point is the law - copyright and not one KI pays for our members content they use to train there KI models.
Copyright (here in DE it is UrhG) has no concern in most cases, cause most (tm) forum posts are not proteced by (DE, EU) law.
Pictures are a different topic.

It's like raising and feeding your own executioner... ;)

That is,. what the AI companies are doing trying.

A Chatbot, which "learned" from a forum database could be an interesting tool for use inside of a forum.
Or see it as a toy. Hey, it has "AI" in the name, has to be a cool thing anyway ;)

I am also interested in this topic. And if there is a easy working solution, i would try this and offer it to my members to play with it.
Using a "Chatbot" as a "Chatbot", where human interaction is the appropriate way (e.g. moderating etc.), is nothing, i like or understand as a step forward.
 
But this is done in any case, except you hide all content from public. Which makes no sense.


Copyright (here in DE it is UrhG) has no concern in most cases, cause most (tm) forum posts are not proteced by (DE, EU) law.
Pictures are a different topic.



That is,. what the AI companies are doing trying.

A Chatbot, which "learned" from a forum database could be an interesting tool for use inside of a forum.
Or see it as a toy. Hey, it has "AI" in the name, has to be a cool thing anyway ;)

I am also interested in this topic. And if there is a easy working solution, i would try this and offer it to my members to play with it.
Using a "Chatbot" as a "Chatbot", where human interaction is the appropriate way (e.g. moderating etc.), is nothing, i like or understand as a step forward.

Right, play with it and understand it. Don't just blanketly reject it. I already made a tool (with some help) to query the database of my CRM software, and it is working pretty cool. It's useful, but you can do analysis that would normally need a pragrammer.
 
I already made a tool (with some help) to query the database of my CRM software, and it is working pretty cool. It's useful, but you can do analysis that would normally need a pragrammer.
AI can produce a lot of automatic analysis. That is particularly useful, if you have to publish some ********bingo PowerPoint sheets. Or, in my case working with generative AI, producing funny background pictures.

The question (as always) is, what is your aim, how does a new technical option brings your closer to that and what is the benefit.

Forums are not about statistics. A forum is based on interaction of humans (more or less). So a tool (like AI-whatever) should make this interaction, communication, sharing knowledge ... better. If it works this way, fine. If not - i am not interested.
 
AI can produce a lot of automatic analysis. That is particularly useful, if you have to publish some ********bingo PowerPoint sheets. Or, in my case working with generative AI, producing funny background pictures.

The question (as always) is, what is your aim, how does a new technical option brings your closer to that and what is the benefit.

Forums are not about statistics. A forum is based on interaction of humans (more or less). So a tool (like AI-whatever) should make this interaction, communication, sharing knowledge ... better. If it works this way, fine. If not - i am not interested.

Lots of statistics I would love to have without having to write code every time:
  1. How many times was "keyword" used in 2022? And which forums (nodes) used the keyword the most?
  2. Who are the top users for March in the X forum?
  3. Can you find all posts where an email address was posted?
  4. What are members saying about Topic X.
  5. I know many people have describes how to do X over the years. Based on forum posts, how should I do X. Please provide examples with links to posts.
  6. Based on the Google Search Console data (that also connected), which topics are getting traffic but not being discussed?
  7. Can you help me find conversations that appear to be sexually explicit?
  8. Can you give me some idea about modifying this plugins code?
  9. Here is the code of Plugin X. I want to make a similar plugin that does Y.
Just tons of things that are possible. It is a little scary, but understanding something is key to being less afraid.
 
Just tons of things that are possible. It is a little scary, but understanding something is key to being less afraid.

Except maybe 3. and 7. (what is managed by moderators), there is nothing i am interested in. And I do not see, what helps this data to make forum discussions better. For example 1. 2. 4. - if you have the data, what would be your next steps and what is the goal?
 
More-so asking the OP…….The reason I ask is many people have training models backwards. Many want to train an OpenAI model from scratch using 100% their data…..and it sounds like this is what the OP is wishing to do, but that’s not how OpenAI has things setup.

OpenAI is intending you to use mostly their knowledge BUT you can “fine tune” things to fit your needs. You train the model on the tone or the length of response…..not necessarily the knowledge it uses to respond.

Now for some niches that may use certain information OpenAI may not really have that knowledge. Let’s take as an example a forum based on knifes, maybe you have a list of where certain knife makers live, what wood they use for their handles, or metals used in their blades……all this is info that openAI may not have but it maybe important to your users. In this case using their Assistant API and turning on the retrieval function allows you to upload files that uses this detailed information. When this occurs the model will access the files and respond using your proprietary info. Plus, according to their terms, this info will NOT be used for training OpenAI’s future models.


Now looking at the OP’s later comments it’s clear he’s got other ideas than I figured. I do foresee this possible in the future, but man I’d hate to see the computing power needed to preform the tasks. That said, you may want to look at this plugin by @ThemeHouse as it does help monitor certain items like #7

https://www.themehouse.com/xenforo/2/addons/google-perspective
 
Let’s take as an example a forum based on knifes, maybe you have a list of where certain knife makers live, what wood they use for their handles, or metals used in their blades……all this is info that openAI may not have but it maybe important to your users.
ACK, Exactly. And a so trained model does not have a database, which has nothing to to with the intended use.
 
A Chatbot, which "learned" from a forum database could be an interesting tool for use inside of a forum.
Do you realy believe, the goal will be to make a free tool for forums?
Why should google have at long term sight a interest on your forum?
Why should a user come in your forum, when a AI can give him the answers of his questions?
Or, in my case working with generative AI, producing funny background pictures.
lol - yeah, maximum helpfull (y);)

How many times was "keyword" used in 2022? And which forums (nodes) used the keyword the most?
Matomo

Who are the top users for March in the X forum?
Matomo

Can you find all posts where an email address was posted?
??? do you dont have ever used the forum search?
Do you know Elastic Search and the add-ons for from Xon?

I know many people have describes how to do X over the years. Based on forum posts, how should I do X. Please provide examples with links to posts.
Elastic Search and switch on own brain? ;-)

Based on the Google Search Console data (that also connected), which topics are getting traffic but not being discussed?
Matomo

Can you help me find conversations that appear to be sexually explicit?
You realy will feed a KI/AI with your members privat messages??? OMG ...
If you will prevent, use the Xenforo core function for stop words.

Can you give me some idea about modifying this plugins code?
Yeah, use it and never understand what the KI presents you will make a forum more stable. OMG... #2

Here is the code of Plugin X. I want to make a similar plugin that does Y.
Oh cool - maximum fun for jurists when you use in worst case other peoples code from a KI answer. I dont think the Ki pays for or go in worst case in jail for you.
I know - worst case. But possible in a time a KI feeds and later sells every bodys content without having rights for.

Just tons of things that are possible. It is a little scary, but understanding something is key to being less afraid.
No, its not scary - its blue-eyed and short-sighted. Sorry.

What may I ask is the niche of your forums?
Garden and Koi ponds (many pictures and and specialist articles) and tractors of a particular brand (many technical specialist content and pictures). 3 forums, not to make money.


Now for some niches that may use certain information OpenAI may not really have that knowledge. Let’s take as an example a forum based on knifes, maybe you have a list of where certain knife makers live, what wood they use for their handles, or metals used in their blades……all this is info that openAI may not have but it maybe important to your users. In this case using their Assistant API and turning on the retrieval function allows you to upload files that uses this detailed information. When this occurs the model will access the files and respond using your proprietary info. Plus, according to their terms, this info will NOT be used for training OpenAI’s future models.
Ok, that's nice for the AI and maybe also for some "simple" users.
But what do I get out of it as the provider of the information? Why should users still get involved in forums or groups when an AI can supposedly answer all questions?
That brings me back to my core message - if you feed the AI, then as a forum operator you are feeding your own executioner

Don't get me wrong, I'm certainly not fundamentally against AI applications. For example, this can bring great progress with translators or in research and science or medicin (to find better therapie methodes eg.).

But AI news and supposed specialist answers... that's not what I would like AI to be a core task. Because at the end of the day the computers and servers have to be paid for, which opens the door to manipulation on an even larger scale than is already possible today.
I consider uncontrolled AI applications to be Pandora's Box in our time. If we open these without thinking, it could also be the end of our current society.

I know people like to laugh at it, but basically we're laying the foundation for a real SkyNet a la Terminator.
It may be that the developers of the AI systems only want good things - that's certainly what they wanted in the beginning with nuclear fission... ;-)
 
That brings me back to my core message - if you feed the AI, then as a forum operator you are feeding your own executioner


AI is already parsing our forums, assuming the content is public, so the executioner is in the building.

AI can be used in your own controlled environment, and do some useful things. PHP could be compared to it. But the AI can allow you to use natural language to get results. That aspect of AI is not at all scary to me.
 
Top Bottom