Choose- What’s right for you… Not what you are lured with!! We are often confused by our needs. Sometimes we do not have options and have to trust what we get as the best and some other times we are showered with so many offers that we are not sure whether what we are choosing is actually what we want. Brand names, attractive features, extravagant style often blinds our judgment and we end up with a rather overpriced superficially complex, and high-maintenance seeking product. Typical MDM Project – Business Value realization The goal of any MDM implementation…
-
-
Weekend started, pored myself a glass of Long Meadow Ranch Anderson Valley Pinot Noir. It smelled like cherry cola, cinnamon, and a forest in autumn. Probably not the right time to think or even blog about OLAP. – Kinshuk Dutta Online analytical processing, or OLAP Is an approach to answer multi-dimensional analytical (MDA) queries swiftly in computing. OLAP is part of the broader category of business intelligence, which also encompasses relational databases, report writing and data mining. Typical applications of OLAP include business reporting for sales, marketing, management reporting, business process management (BPM), budgeting and forecasting, financial reporting and similar…
-
Uncategorized
Social CRM – A Cult
Am I Becoming Vulnerable – Social CRM The more information one puts on the net the more vulnerable he/she is becoming. Earlier crooks were playing with it illegally but now ethically people are using information pertaining to someone available on WWW to analyze, understand, and peruse the person for one’s own gain. Does it sound weird? Actually not. That’s what Social CRM is all about. A couple of years back everybody was talking about Social CRM. The hype was much more than the promo of Yash Raj Films. At that time things were much mystified and mystical, so what exactly…
-
What are the different classification of Data? Data can be widely classified as: Transactional Data Master Data Reference Data Meta Data Analytical Data What is Master Data? Master data is typically persistent, non-transactional data utilized by multiple systems that define the primary business entities. Master Data may include data about customers, products, employees, inventory, suppliers, and sites. What is Data Profiling? Data profiling is the process of examining the data available from an existing information source and collecting statistics or informative summaries about that data. They are widely categorized as Structure discovery Content Discovery & Relationship Discovery In order to…
-
What is Master Data Management It’s the process of managing “Master Data”. Master Data Management is a technology-enabled discipline in which business and Information Technology work together to ensure the uniformity, accuracy, stewardship, semantic consistency, and accountability of the enterprise’s official shared master data assets. Wikipedia What is Master Data Enterprize data can be broadly categorized as : Transactional data are the elements that support the on-going operations of an organization and are included in the application systems that automate key business processes. This can include areas such as sales, service, order management, manufacturing, purchasing, billing, accounts receivable, and accounts…
-
In my recent post I tried explaining how different data collection mechanisms are available and how due to modern day requirement, modern data lakes were formed. Iceberg is one such solution that came out really strong. What is Apache ICEBERG? Apache Iceberg is an open table format for huge analytic datasets. Iceberg adds tables to Trino and Spark that use a high-performance format that works just like a SQL table. With special emphasis on User Experience Reliability and Performance & Open Standards What makes it special is its unique table design for big data. This is explained brilliantly and covered well…
-
What is Apache Superset? Apache Superset is a modern, enterprise-ready business intelligence web application. It is fast, lightweight, intuitive, and loaded with options that make it easy for users of all skill sets to explore and visualize their data, from simple pie charts to highly detailed deck.gl geospatial charts. Why use Apache Superset? Apache Superset Open Source BI: almost the alternative to Tableau by Susana Santos 2018-09-17 In a recent blog posted on Jan 19. Dropbox explained why they selected Superset. This matrix from the same blog explains it in a nutshell. The entire blog is accessible over here. Installing Superset…
-
Introduction: My Journey into Presto My interest in Presto was sparked in early 2021 after an enriching conversation with Brian Luisi, PreSales Manager at Starburst. His insights into distributed SQL query engines opened my eyes to the unique capabilities and performance advantages of Presto. Eager to dive deeper, I joined the Presto community on Slack to keep up with developments and collaborate with like-minded professionals. This blog series is an extension of that journey, aiming to demystify Presto and share my learnings with others curious about distributed analytics solutions. What is PRESTO Presto is a high performance, distributed SQL query…
-
Data Lake The modern enterprise runs on data. However storing the same has always been challenging, expensive and it results in data silos. A data lake consists of a cost-effective and scalable storage system along with one or more compute engines. Data Lakes are consolidated, centralized storage areas for raw, unstructured, semi-structured, and structured data, taken from multiple sources and lacking a predefined schema. Data Lakes have been created to save data that “may have value.” It supports a broad range of essential functions from traditional decision support to business analytics to data science. The value of data and the insights…
-
In recent years, data storage has undergone significant transformation. While data lakes have become central to modern data architecture, a new contender has emerged: the data lakehouse. With its blend of traditional data lake flexibility and data warehouse reliability, the lakehouse model aims to address some of the challenges that data lakes face today, including data integrity and workload diversity. This blog explores the evolution from data lakes to data lakehouses and highlights key differences that are redefining how organizations manage their data. The Role of Data Lakes in Modern Data Management A data lake is a centralized repository designed…