<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
xmlns:content="http://purl.org/rss/1.0/modules/content/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:atom="http://www.w3.org/2005/Atom"
xmlns:sy="http://purl.org/rss/1.0/modules/syndication/">
	<channel>
		<title>Author at Infinum</title>
		<atom:link href="https://infinum.com/blog/author/nikola-miladinovic/feed/" rel="self" type="application/rss+xml" />
		<link></link>
		<description>Building digital products</description>
		<lastBuildDate>Fri, 03 Apr 2026 12:58:20 +0000</lastBuildDate>
		<sy:updatePeriod>hourly</sy:updatePeriod>
		<sy:updateFrequency>1</sy:updateFrequency>

					<item>
				<image>
					<url>19273649https://infinum.com/uploads/2025/12/img-databricks-hero-min.webp</url>
				</image>
				<title>From RAG to Riches: Strengthening Your Cloud AI Foundation with Databricks</title>
				<link>https://infinum.com/blog/scaling-ai-with-databricks/</link>
				<pubDate>Tue, 13 Jan 2026 16:32:40 +0000</pubDate>
				<dc:creator>Nikola Miladinović</dc:creator>
				<guid isPermaLink="false">https://infinum.com/?p=19273649</guid>
				<description>
					<![CDATA[<p>When AI workloads scale, cloud-native tools fall short on governance, versioning, and observability. Databricks fills those gaps without replacing your cloud.</p>
<p>The post <a href="https://infinum.com/blog/scaling-ai-with-databricks/">From RAG to Riches: Strengthening Your Cloud AI Foundation with Databricks</a> appeared first on <a href="https://infinum.com">Infinum</a>.</p>
]]>
				</description>
				<content:encoded>
					<![CDATA[<div
	class="wrapper"
	data-id="es-206"
	 data-animation-target='inner-items'>
		
			<div class="wrapper__inner">
			<div class="block-blog-content js-block-blog-content">
	
<div class="block-blog-content-sidebar" data-id="es-92">
	</div>

<div class="block-blog-content-main">
	
<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-95"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-93">
	<p	class='typography typography--size-36-text js-typography block-typography__typography'
	data-id='es-94'
	>
	<strong>Your RAG system works great until it doesn&#8217;t. As AI workloads scale, cloud-native tools begin to show cracks in governance, versioning, and observability. We explore how Databricks fills these gaps without replacing your existing AWS or Azure infrastructure.</strong></p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-98"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-96">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-97'
	>
	Most teams already run reliable AI workloads on AWS or Azure. These platforms come with mature services that power modern production systems. Azure OpenAI, Cognitive Search, Blob Storage, AWS Bedrock, OpenSearch, and S3 all support high-quality RAG architectures and handle identity, networking, scaling, and operational reliability with ease.</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-101"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-99">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-100'
	>
	But as AI systems grow, technical demands increase, data volumes expand, new document sources emerge, multiple teams work with the same information, and models evolve more frequently. That’s when cracks start to show. Cloud-native tools, built primarily for storage, compute, and serving, struggle to keep up. They lack unified governance, lineage tracking, and transformation pipelines needed to maintain consistency across growing AI workloads. </p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-104"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-102">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-103'
	>
	The challenge then shifts from building a functional RAG system to orchestrating a governed data foundation, exactly what <strong><a href="https://infinum.com/artificial-intelligence/custom-solutions/">custom AI solutions</a></strong> are designed to address.</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-106"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-highlighted-text">
	<p	class='typography typography--size-36-text js-typography block-highlighted-text__typography'
	data-id='es-105'
	>
	At Infinum, we use Databricks as part of our <strong><a href="https://infinum.com/artificial-intelligence/data-engineering/">data engineering practice</a></strong> to future-proof our clients&#8217; AI architecture. We&#8217;ll walk you through its core capabilities, <em>brick by brick</em>, to show you how they work together to help you scale your cloud AI with confidence.</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-109"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-107">
	<h2	class='typography typography--size-52-default js-typography block-typography__typography'
	data-id='es-108'
	>
	<strong>Unity Catalog: one layer to rule them all (your data, models, and vectors)</strong></h2></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-112"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-110">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-111'
	>
	<a href="https://www.databricks.com/product/unity-catalog" target="_blank" rel="noreferrer noopener">Unity Catalog</a> is the central governance and metadata layer of the Databricks platform. It brings data, models, vector indexes, and functions under a single, consistent structure, so everything is defined, tracked, and secured in one place. This means simplified permission management and the elimination of fragmentation caused by different services each maintaining their own access rules.</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-115"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-113">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-114'
	>
	 Unity Catalog also automatically captures lineage, making it easy to trace how data flows through each stage of your AI pipeline, from ingestion to preprocessing, embedding, retrieval, and inference.</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-118"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-116">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-117'
	>
	The result is a unified and predictable governance model that reduces complexity and supports reliable AI development across teams and cloud environments.</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-121"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-119">
	<h2	class='typography typography--size-52-default js-typography block-typography__typography'
	data-id='es-120'
	>
	<strong>From unversioned storage to reproducible data with Delta Lake</strong></h2></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-124"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-122">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-123'
	>
	With governance handled by Unity Catalog, the next layer to stabilize is storage itself. RAG systems thrive on structure and stability. But in practice, documents change frequently, models are retrained, and embeddings are regenerated. Without versioning and transactional integrity, it’s hard to explain model behavior or validate changes.</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-127"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-125">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-126'
	>
	Delta Lake solves this challenge by layering ACID guarantees, schema enforcement, and time travel on top of cloud storage. Each Delta table becomes a versioned source of truth for both structured data from databases and unstructured data like PDFs and HTML. Ingestion becomes predictable instead of brittle. Teams can replay experiments without guessing which files existed at a given point in time. Even unstructured content can be governed just like structured tables, using managed volumes.</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-130"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-128">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-129'
	>
	For teams prioritizing reproducibility and transparency, Delta Lake adds the versioning and transactional guarantees that object storage alone cannot provide.</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-133"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-131">
	<h2	class='typography typography--size-52-default js-typography block-typography__typography'
	data-id='es-132'
	>
	<strong>Why retrieval belongs next to your data</strong></h2></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-136"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-134">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-135'
	>
	With stable, versioned data in place, the next challenge is fast, reliable retrieval. Some engineering teams choose to complement their existing retrieval stack with Databricks Vector Search, especially when co-locating retrieval with the underlying data provides a performance or governance advantage. Integrating retrieval into the lakehouse platform offers several benefits:</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-139"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="lists" data-id="es-137">
	<ul	class='typography typography--size-16-text-roman js-typography lists__typography'
	data-id='es-138'
	>
	<li><strong>Synchronized indexes: </strong>Vector indexes stay in sync with the Delta tables that feed them.</li><li><strong>Automatic embedding updates:</strong> Embeddings can be configured to refresh automatically when source documents change.</li><li><strong>Lower latency:</strong> Retrieval queries run in the same compute environment as the data, reducing round-trip times and response times.</li><li><strong>Consistent governance:</strong> Indexes inherit permissions, lineage, and catalog rules, keeping access control and tracking consistent.</li><li><strong>Easier evaluation workflows: </strong>Co-located retrieval is ideal for comparing embedding models or running offline simulations to detect drift.</li></ul></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-142"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-media">
	<div	class="media block-media__media media__border--none media__align--center-center"
	data-id="es-140"
	 data-media-type='image'>

	<figure class="image block-media__image-figure image--size-stretch" data-id="es-141">
	<picture class="image__picture block-media__image-picture">
								
			<source
				srcset=https://infinum.com/uploads/2026/01/in-article-databricks_correction-2-1400x753.webp				media='(max-width: 699px)'
				type=image/webp								height="753"
												width="1400"
				 />
								
			<source
				srcset=https://infinum.com/uploads/2026/01/in-article-databricks_correction-2-2400x1291.webp				media='(max-width: 1199px)'
				type=image/webp								height="1291"
												width="2400"
				 />
												<img
					src="https://infinum.com/uploads/2026/01/in-article-databricks_correction-2.webp"
					class="image__img block-media__image-img"
					alt=""
										height="1338"
															width="2488"
										loading="lazy"
					 />
					</picture>

	</figure></div></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-145"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-143">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-144'
	>
	Unity Catalog handles governance, Delta Lake tracks every version from raw files to embeddings, and Databricks Vector Search continuously syncs with your data as it changes.</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-148"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-146">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-147'
	>
	For teams focused on performance, governance, and evaluation, this level of integration adds speed and structure to otherwise complex retrieval pipelines.<strong><br />
</strong></p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-151"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-149">
	<h3	class='typography typography--size-52-default js-typography block-typography__typography'
	data-id='es-150'
	>
	<strong>Keep your models where your data is</strong></h3></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-154"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-152">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-153'
	>
	Getting data and retrieval right is only part of the equation. Now’s the time to plug in the models.&nbsp;</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-157"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-155">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-156'
	>
	Databricks Model Serving can help you deploy open-source foundation models, fine-tune custom variants, or run embedding models directly alongside their data, without bolting on separate infrastructure. Whether you&#8217;re working with large language models for generative AI or specialized embedding models for your RAG application, everything remains connected through Unity Catalog.</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-160"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-158">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-159'
	>
	You can track the entire lifecycle of a model from initial training to production deployment. This enables a multi-model strategy, allowing you to select the best tools for each use case without introducing operational complexity.</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-163"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-161">
	<h3	class='typography typography--size-52-default js-typography block-typography__typography'
	data-id='es-162'
	>
	<strong>No more duct-taping your AI pipelines together</strong></h3></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-166"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-164">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-165'
	>
	Modern retrieval-augmented generation workflows require more than just storage and compute. They need orchestration, monitoring, and continuous improvement loops. Databricks provides integrated tooling for the entire RAG architecture:</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-169"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="lists" data-id="es-167">
	<ul	class='typography typography--size-16-text-roman js-typography lists__typography'
	data-id='es-168'
	>
	<li><strong>AI Playground:</strong> Quickly prototype and test different foundation models and prompts in an interactive environment. Experiment with how generative AI models respond using context from your data.</li><li><strong>Mosaic AI Agent Framework:</strong> Build <a href="https://infinum.com/artificial-intelligence/agent-development/" id="https://infinum.com/artificial-intelligence/agent-development/">intelligent agents</a> that go beyond simple Q&amp;A. These agents can perform complex, multi-step tasks by querying structured data, retrieving documents from vector stores, and synthesizing answers. For a deeper look at how agents connect to external systems, see our overview of <strong><a href="https://infinum.com/blog/model-context-protocols-mcp-ai-enabled-businesses/">Model Context Protocol and AI-enabled businesses</a></strong>.</li><li><strong>Databricks Workflows:</strong> Long-lived pipelines that ingest documents, clean them, segment them, embed them, index them, and validate them, all within the lakehouse. Keeping data-intensive steps in one place eliminates cross-service coordination overhead.<br />
</li></ul></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-172"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-170">
	<h3	class='typography typography--size-52-default js-typography block-typography__typography'
	data-id='es-171'
	>
	<strong>You can’t improve your system if you can’t observe it</strong></h3></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-175"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-173">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-174'
	>
	As RAG systems mature, observability becomes just as critical as modeling itself. Retrieval performance shifts gradually. Embeddings drift as data evolves. Large language model answers change with new versions.</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-178"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-176">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-177'
	>
	<a href="https://www.databricks.com/product/machine-learning/lakehouse-monitoring" target="_blank" rel="noreferrer noopener">Lakehouse Monitoring</a> lets you track everything from data quality to model behavior, all in one place. Instead of piecing together logs across disconnected services, you get a single, consolidated view of AI behavior in production, which pairs well with <strong><a href="https://infinum.com/blog/ai-data-visualization/">AI data visualization</a></strong> approaches for surfacing those insights to stakeholders.</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-181"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-media">
	<div	class="media block-media__media media__border--none media__align--center-center"
	data-id="es-179"
	 data-media-type='image'>

	<figure class="image block-media__image-figure image--size-stretch" data-id="es-180">
	<picture class="image__picture block-media__image-picture">
								
			<source
				srcset=https://infinum.com/uploads/2025/12/in-article-databricks-3-novo-1400x753.webp				media='(max-width: 699px)'
				type=image/webp								height="753"
												width="1400"
				 />
								
			<source
				srcset=https://infinum.com/uploads/2025/12/in-article-databricks-3-novo-2400x1291.webp				media='(max-width: 1199px)'
				type=image/webp								height="1291"
												width="2400"
				 />
												<img
					src="https://infinum.com/uploads/2025/12/in-article-databricks-3-novo.webp"
					class="image__img block-media__image-img"
					alt=""
										height="1338"
															width="2488"
										loading="lazy"
					 />
					</picture>

	</figure></div></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-184"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-182">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-183'
	>
	A user query is enriched with relevant context from Vector Search, answered by a large language model, and continuously evaluated through Lakehouse Monitoring to ensure data quality, retrieval relevance, and response reliability.</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-187"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-185">
	<h3	class='typography typography--size-36-text js-typography block-typography__typography'
	data-id='es-186'
	>
	<strong>A question every AI team should ask </strong></h3></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-190"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-188">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-189'
	>
	If your AI workload doubled in size tomorrow, would your current data and governance structures scale with the same confidence as your application layer?</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-193"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-191">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-192'
	>
	If the answer isn’t a clear yes, it might be time to lay a stronger foundation with Databricks.</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-195"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-highlighted-text">
	<p	class='typography typography--size-36-text js-typography block-highlighted-text__typography'
	data-id='es-194'
	>
	<strong><strong>Introducing Databricks into an existing environment is not a platform replacement. It is an architectural enhancement that consolidates governance, data reliability, model lifecycle management, and observability. </strong></strong></p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-198"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-196">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-197'
	>
	The underlying cloud continues to operate application and networking layers, while Databricks provides the durable, governed data foundation needed for long-term AI operations. With <a href="https://6sense.com/tech/big-data-analytics/databricks-market-share" target="_blank" rel="noreferrer noopener">Databricks capturing ~17% of the data warehouse market</a> as of November 2025, its role in enterprise AI infrastructure continues to grow.</p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-201"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-199">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-200'
	>
	If you&#8217;re ready to accelerate your RAG architecture or take the next leap in your AI platform, <a href="https://infinum.com/artificial-intelligence/" id="https://infinum.com/artificial-intelligence/" target="_blank" rel="noreferrer noopener">our team can help you build a modern, scalable foundation designed for long-term success.</a></p></div>	</div>

<div
	class="wrapper wrapper__use-simple--true"
	data-id="es-204"
	 data-animation='slideFade' data-animation-target='inner-items'>
		
			<div class="block-typography" data-id="es-202">
	<p	class='typography typography--size-16-text-roman js-typography block-typography__typography'
	data-id='es-203'
	>
	See how we built a <strong><a href="https://infinum.com/work/midtown-business-intelligence-platform/">real-time data intelligence platform for Midtown Athletic Club</a></strong> as an example of data engineering in practice.</p></div>	</div>
</div>
</div>		</div>
	</div><p>The post <a href="https://infinum.com/blog/scaling-ai-with-databricks/">From RAG to Riches: Strengthening Your Cloud AI Foundation with Databricks</a> appeared first on <a href="https://infinum.com">Infinum</a>.</p>
]]>
				</content:encoded>
			</item>
		
	</channel>
</rss>