Real-Time Oracle Database to Google BigQuery: Powering AI-Driven Analytics with Oracle GoldenGate 23ai
Modern AI and machine learning initiatives demand fresh data to deliver accurate predictions and...

Artificial Intelligence (AI) is buzzing in today’s enterprise circles. Let’s briefly examine using AI in data integration, specifically with Oracle GoldenGate. First, what are the benefits of AI in data integration?
Using AI in data integration offers several significant benefits, enhancing data management processes’ efficiency, accuracy, and overall capabilities. Here are some of the key advantages:
By leveraging AI in data integration, organizations can achieve more reliable, efficient, and scalable data management, ultimately driving better business outcomes and gaining a competitive edge.
These eleven benefits are all great in concept, but how do you put these into practice?
Oracle GoldenGate has always been the interface for successful data integration for many organizations. With the recent rise in AI, many organizations seek solutions to build robust AI infrastructure with minimal downtime and maximum value.
In reviewing all the data integration platforms out there – Oracle GoldenGate, FiveTran, Qlik, Airbyte, Striim, and many others – Oracle GoldenGate has the most robust and stable approach to tackling the new age of AI, although it is geared mainly towards Oracle and PostgreSQL workloads currently.
With the release of Oracle GoldenGate 23ai, many are excited about getting to see AI at work; however, the 23ai in Oracle GoldenGate 23ai only means that Oracle GoldenGate can replicate the new datatype within Oracle Database 23ai – Vector datatype. The core replication concepts that Oracle GoldenGate follows are still the bedrock of replication.
Since we touched on the vector datatype, let’s look at what vectors are and their benefits.
Vectors are nothing new within the IT industry; after all, search engines have used different vectors for decades. The vectors being introduced now are high-dimensional numerical representations of data items. These vectors capture the semantic meaning of the data, enabling machines to understand and process the information effectively.
The primary purpose of vectors is to use them within Retrieval Augmentation Generation (RAG) to facilitate efficient and accurate retrieval of relevant information from a large corpus of structured and unstructured data.
There are four steps to using a vector with Retrieval Augmentation Generation (RAG). The basic concepts are:
Now, with a brief understanding of what vectors are, how they are used, and their benefits, how does this apply to Oracle GoldenGate 23ai?
Reviewing a simple uni-directional use case, like populating a data warehouse running Oracle Database 23ai (cloud or on-premises) in the manufacturing vertical.
Data is captured from data marts located at remote sites and applied to the data warehouse at a central site. The steps would look like:
The six steps in the previous section are based on general replication principles defined in the CAP theorem. Under CAP, a data replication system ensures consistency (among replicated copies), availability (of the system for read/write operations), and partition tolerance (in the face of the nodes in the system being partitioned by a network fault).
When overlaying AI onto these replication steps, it is nothing more than ensuring that vectors, the data that drive an RAG system, are available where needed for usage within an RAG system. In Oracle GoldenGate 23ai, Oracle has ensured that vectors can be captured, transferred, and applied to data platforms that support the vector datatype—enabling organizations to bring their real-time unstructured data to their AI applications and delivering the next-generation user experiences.
Oracle GoldenGate 23ai has added to its extensive heterogeneous platforms by including popular vector databases in its growing portfolio. The new additions to the portfolio are:
Now that Oracle GoldenGate 23ai supports so many different Oracle and non-Oracle vector platforms, getting your mission-critical unstructured data to where it is needed is possible. The data integration patterns below can be used immediately after upgrading to Oracle GoldenGate 23ai.
Migration of vectors to Oracle Database 23ai vector database
Multi-Master/Multi-Cloud/Active-Active database replication

Consolidation of vector changes
With the ability to now take an organization’s real-time unstructured data, represented as vectors, an organization can move structured and unstructured data across the enterprise.
As Artificial Intelligence (AI) starts to take hold in organizations, the typical use case that will leverage Oracle GoldenGate 23ai will be private Retrieval Augmented Generation (RAG) systems. These private RAG systems will leverage Oracle GoldenGate 23ai by moving structured and unstructured data to a central data hub where vectors will be leveraged against private large language models.
An illustration of the architecture this would follow is:

In this illustration, you can see that Oracle GoldenGate 23ai is moving data from various database sources to the data warehouse in real-time. Don’t let the illustration fool you; the sources that Oracle GoldenGate 23ai can capture include more than databases now. New capture types that can be supported include ERP & SaaS applications, Vector stores, Event Messaging, and NoSQL.
Although Oracle GoldenGate 23ai makes it easy to move structured and unstructured data in real-time, including vectors, there is an inherent problem here. This problem is common with all vector databases – what embedding model is used?
As AI begins to grow and be used more in enterprises, ensuring that the correct embedding model to embed data is used will be critical. Because we can now replicate vectors and enable Retrieval Augmented Generation (RAG) platforms with fresh and relevant data, it doesn’t mean that data sources can have different embeddings. What needs to be understood and combed are:
These are only some of the questions that should be asked or evaluated before building a Retrieval Augmented Generation (RAG) system or implementing real-time replication of vectors with Oracle GoldenGate 23ai.
In closing, Oracle GoldenGate 23ai is a leap forward in ensuring that enterprises can quickly and dynamically build Retrieval Augmented Generation (RAG) systems for their private and secure use cases.
Modern AI and machine learning initiatives demand fresh data to deliver accurate predictions and...
One of the biggest issues with Oracle GoldenGate over the years has been the database permissions...
Oracle GoldenGate has long been the go-to solution for real-time data integration, and with the...