{<Z Kordian Zadrożny

AI, Strony WWW, Programowanie, Bazy danych

AI Hallucinations Unveiled: Why ChatGPT Makes Things Up and How We Can Fix It

AI Hallucinations Unveiled: Why ChatGPT Makes Things Up and How We Can Fix It

I recently read a very interesting paper (https://arxiv.org/pdf/2401.01313v1) titled "Why Language Models Hallucinate" by Adam Tauman Kalai, Ofir Nachum, Santosh S. Vempala, and Edwin Zhang. The authors analyze the causes of so-called "hallucinations," which are situations where a language model (LLM, or popularly, AI) says something that sounds absolutely credible but is completely untrue. Have you ever asked an AI chatbot a question and received a beautifully phrased, confident... falsehood? It might be a made-up book title, a non-existent historical fact, or, as in one study's example, three different, incorrect birth dates for the same person. This phenomenon, known in the industry as "hallucination," is one of the biggest barriers to fully trusting artificial intelligence. A new scientific paper sheds light on this problem, arguing that hallucinations aren't a mysterious glitch but a logical consequence of how we train and evaluate language models. In short: we ourselves have taught AI that guessing pays off. The Original Sin of AI: Errors from the Training Stage It all begins at the "pretraining" stage, when the model digests vast amounts of text from the internet to learn language patterns. The study's authors show that even with perfectly clean training data, statistics are relentless. They explain this with a clever comparison to a binary classification problem. Imagine the AI's task isn't to generate text, but to answer "true" or "false" to statements. It turns out that generating correct sentences is significantly harder than simply evaluating their correctness. What's more, the researchers established a mathematical relationship: The error rate of a model's generated output is at least twice as high as its error rate in judging what is true and what is false. This is especially evident with facts that appear very rarely in the training data. If information about someone's birth date appeared only once across the entire internet, the model statistically...

read more
What do IT/AI, karate, and science fiction have in common? Welcome to my blog.

What do IT/AI, karate, and science fiction have in common? Welcome to my blog.

Hi, I'm Kordian. For over twenty years, the world of IT has been my natural environment. On a daily basis, I lead an IT team and design and build information systems. But when the workday ends, my passion for technology doesn't. I've always been fascinated by connecting seemingly distant elements. What links the discipline I learned on the karate mat with designing IT systems? How can thinking about starships contribute to creating better business applications? And how is artificial intelligence changing the rules of the game not only in our professional lives but in our entire existence? This blog is a space where I want to talk about just that. Without the corporate jargon – I'm an old-school guy, from a time before the great corporatization, and despite working in such organizations, some standards are still foreign to me. I want to share three areas that particularly interest me here: Technology that makes real sense: A practical look at AI, ERP systems, and, more broadly, IT systems that support business. We'll analyze how technology can genuinely help companies, not just look impressive on slides. I want to show real-world applications of AI – not marketing promises that end with prototypes and nothing more (besides significant losses), but an understanding of how large language models work and the areas where they are truly worth using today. The big questions: Science, space, and everything that makes us look at the stars and ask, "what's next?". This has been my constant fascination since I learned to read. Science and science fiction – I simply love them. Creativity in action: I sometimes write science fiction short stories. Until now, they've ended up in a drawer, but I thought it was time to change that. I've decided to occasionally share the fruits of that creative joy here. In short, this will be a blog about technology with a human face. To give you a taste of what we'll be discussing, here are a few news items that recently caught my attention....

read more