Last year around this time, over a warm cup of hot cocoa, I reflected on the NotPetya cyberattack, a global catastrophe that reshaped how we
Introduction Back in 2013, I began blogging about Big Data, diving into the ways massive data volumes and new technologies were transforming industries. Over the
In 2019, we explored the foundations of neural networks—how layers of interconnected nodes mimic the human brain to extract patterns from data. Since then, one
“Before machines can understand us, they need to know where one word ends and another begins.” 🧠 Introduction: Why Tokenization Matters Natural Language Processing (NLP)
Introduction: Enhancing Trino Performance In our journey with Trino, we’ve explored its setup, integrated it with multiple data sources, added real-time data, and expanded to
Introduction: Scaling Data with Cloud Storage In the previous blogs, we explored building a sample project locally, optimizing queries, and adding real-time data streaming. Now,
Introduction: Building on the Basics In our last blog, we set up a local Trino project for a sample use case—Unified Sales Analytics—allowing us to
Why a Trino Series Instead of Presto? If you followed the initial post in this series, you may recall we discussed the history of Presto
Solr is the popular, blazing-fast, open-source enterprise search platform built on Apache Lucene™. Here is a example of how Solr might be integrated into an