this post was submitted on 28 Oct 2024
1536 points (98.7% liked)

Technology

60116 readers
2652 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] Zos_Kia@lemmynsfw.com 1 points 1 month ago (1 children)

What kind of use-cases was it, where you didn't find suitable local models to work with ? I've found that general "chatbot" things are hit and miss but more domain-constrained tasks (such as extracting structured entities from unstructured text) are pretty reliable even on smaller models. I'm not counting my chickens yet as my dataset is still somewhat small but preliminary testing has been very promising in that regard.

[–] xavier666@lemm.ee 2 points 1 month ago (1 children)

What kind of use-cases was it, where you didn’t find suitable local models to work with ?

Any time you ask very domain specific questions; eg "i have collected some soil samples from the mesolithic age near the Amazon basin which have high sulfur and phosphorus content compared to my other samples. What factors could contribute to this distribution?", both of-the-shelf local models & OpenAI fail.

The main reason is because these models are not trained on highly-specialized domains of text. Sometimes the models start hallucinating and which reduces our trust upon them.

[–] Zos_Kia@lemmynsfw.com 2 points 1 month ago

“i have collected some soil samples from the mesolithic age near the Amazon basin which have high sulfur and phosphorus content compared to my other samples. What factors could contribute to this distribution?”

Haha yeah the top execs were tripping balls if they thought some off-the-shelf product would be able to answer this kind of expert questions. That's like trying to replace an expert craftsman with a 3D printer.