July 7, 2025

Category: Big Data

Maximize Data Quality and Insights
Big Data

Maximize Data Quality and Insights

A former colleague recently asked me to explain my role at Precisely. After my (admittedly lengthy) explanation of what I do as the EVP and GM of our Enrich business, she summarized it in a very succinct, but new way: “Oh, you manage the appending datasets.” That got me thinking. We often use different terms […]

Read More
Qumulo Expands Its Cloud Data Fabric with AI-Powered NeuralCache
Big Data

Qumulo Expands Its Cloud Data Fabric with AI-Powered NeuralCache

Shutterstock Qumulo, an enterprise data management and storage company, has unveiled a new predictive caching solution called Qumulo NeuralCache. The new tool is designed to supercharge data performance for AI-driven enterprise applications and critical line-of-business workloads. It leverages AI and machine learning (ML) models to dynamically optimize read/write caching across cloud and on-premises environments.  The […]

Read More
The Power of Fine-Tuning on Your Data
Big Data

The Power of Fine-Tuning on Your Data

Summary: LLMs have revolutionized software development by increasing the productivity of programmers. However, despite off-the-shelf LLMs being trained on a significant amount of code, they are not perfect. One key challenge for our Enterprise customers is the need to perform data intelligence, i.e., to adapt and reason using their own organization’s data. This includes being able to use […]

Read More
Recipes to Vectors: Using OpenSearch as Vector Database
Big Data

Recipes to Vectors: Using OpenSearch as Vector Database

Learn how to use OpenSearch as a vector database in this hands-on guide. Explore hybrid search, work with low-level APIs, and build vector-powered applications combining keyword and semantic search. In a previous blog, we introduced vector search using OpenSearch and Elasticsearch. We looked at the differences between keyword and semantic search and explored how vectors […]

Read More
How Big Data is Revolutionizing Waste Management in Tulsa
Big Data

How Big Data is Revolutionizing Waste Management in Tulsa

In recent years, cities across the United States have increasingly turned to technology to address their growing waste management challenges. One of the most promising innovations is the use of big data. The city of Tulsa is no exception, as big data plays an increasingly important role in transforming how waste is managed. By leveraging […]

Read More
Manage concurrent write conflicts in Apache Iceberg on the AWS Glue Data Catalog
Big Data

Manage concurrent write conflicts in Apache Iceberg on the AWS Glue Data Catalog

In modern data architectures, Apache Iceberg has emerged as a popular table format for data lakes, offering key features including ACID transactions and concurrent write support. Although these capabilities are powerful, implementing them effectively in production environments presents unique challenges that require careful consideration. Consider a common scenario: A streaming pipeline continuously writes data to […]

Read More
Vectara Launches Open Source Framework for RAG Evaluation
Big Data

Vectara Launches Open Source Framework for RAG Evaluation

Palo Alto, April 8, 2025 – Vectara, a platform for enterprise Retrieval-Augmented Generation (RAG) and AI-powered agents and assistants, today announced the launch of Open RAG Eval, its open-source RAG evaluation framework. The framework, developed in conjunction with researchers from the University of Waterloo, allows enterprise users to evaluate response quality for each componentand configuration […]

Read More
Maximize SEO Success with Powerful Data Analytics Insights
Big Data

Maximize SEO Success with Powerful Data Analytics Insights

Struggling to improve your ROI? Learn about the key metrics to consider when monitoring the progress of your SEO operations here. Analytics is the driver of maximizing the success of SEO initiatives. We talked about this in our article on the benefits of data-driven SEO. In a market that’s projected to expand from $89.1 billion […]

Read More
Digital Archiving: 5 Best Practices
Big Data

Digital Archiving: 5 Best Practices

As organizations generate and manage more digital content, having a strong digital archiving strategy is essential. Regulations continue to change, customer expectations continue to grow, and businesses must balance accessibility with security. A poorly managed archiving system can lead to compliance risks, data silos, and inefficiencies that slow down operations. In this blog, we’ll look […]

Read More