Abhishek's blog

PostgreSQL pgoutput plugin for change data capture

Posted on August 13, 2020 | 5 minutes |

Set up a Change Data Capture architecture on Azure using Debezium, Postgres and Kafka was a tutorial on how to use Debezium for change data capture from Azure PostgreSQL and send them to Azure Event Hubs for Kafka - it used the wal2json output plugin.

What about the `pgoutput` plugin?

This blog will provide a quick walk through of how to pgoutput plugin and provide clarification on this point raised by Denis Arnaud (thank you for brining it up!)

[Read More]

postgresql kafka azure

How to use Azure Go SDK to manage Azure Data Explorer clusters

Posted on July 30, 2020 | 8 minutes |

Getting started with Azure Data Explorer using the Go SDK covered how to use the Azure Data Explorer Go SDK to ingest and query data from azure data explorer to ingest and query data. In this blog you will the Azure Go SDK to manage Azure Data Explorer clusters and databases.

Azure Data Explorer (also known as Kusto) is a fast and scalable data exploration service for analyzing large volumes of diverse data from any data source, such as websites, applications, IoT devices, and more. This data can then be used for diagnostics, monitoring, reporting, machine learning, and additional analytics capabilities.

[Read More]

golang azure data explorer big data nosql azure

Kafka on Kubernetes, the Strimzi way! (Part 4)

Posted on July 28, 2020 | 10 minutes |

Welcome to part four of this blog series! So far, we have a Kafka single-node cluster with TLS encryption on top of which we configured different authentication modes (TLS and SASL SCRAM-SHA-512), defined users with the User Operator, connected to the cluster using CLI and Go clients and saw how easy it is to manage Kafka topics with the Topic Operator. So far, our cluster used ephemeral persistence, which in the case of a single-node cluster, means that we will lose data if the Kafka or Zookeeper nodes (Pods) are restarted due to any reason.

[Read More]

kafka kubernetes strimzi

Tutorial: Getting started with Azure Data Explorer using the Go SDK

Posted on July 22, 2020 | 9 minutes |

With the help of an example, this blog post will walk you through how to use the Azure Data explorer Go SDK to ingest data from a Azure Blob storage container and query it programmatically using the SDK. After a quick overview of how to setup Azure Data Explorer cluster (and a database), we will explore the code to understand what’s going on (and how) and finally test the application using a simple CLI interface

[Read More]

golang azure data explorer big data nosql azure

Orchestrate Azure Event Hubs via Kubernetes

Posted on July 13, 2020 | 7 minutes |

Azure Service Operator is an open source project to help you provision and manage Azure services using Kubernetes. Developers can use it to provision Azure services from any environment, be it Azure, any other cloud provider or on-premises - Kubernetes is the only common denominator!

It can also be included as a part of CI/CD pipelines to create, use and tear down Azure resources on-demand. Behind the scenes, all the heavy lifting is taken care of by a combination of Custom Resource Definitions which define Azure resources and the corresponding Kubernetes Operator(s) which ensure that the state defined by the Custom Resource Definition is reflected in Azure as well.

[Read More]

azure event hubs kubernetes

Kafka on Kubernetes, the Strimzi way! (Part 3)

Posted on July 7, 2020 | 11 minutes |

Over the course of the first two parts of this blog series, we setup a single-node Kafka cluster on Kubernetes, secured it using TLS encryption and accessed the broker using both internal and external clients. Let’s keep iterating! In this post, we will continue the Kafka on Kubernetes journey with Strimzi and cover:

How to apply different authentication types: TLS and SASL SCRAM-SHA-512
Use Strimzi Entity operator to manage Kafka users and topics
How to configure Kafka CLI and Go client applications to securely connect to the Kafka cluster

The code is available on GitHub - https://github.com/abhirockzz/kafka-kubernetes-strimzi/
[Read More]

kafka kubernetes strimzi

Change Data Capture architecture using Debezium, Postgres and Kafka

Posted on July 2, 2020 | 9 minutes |

Change Data Capture (CDC) is a technique used to track row-level changes in database tables in response to create, update and delete operations. Different databases use different techniques to expose these change data events - for example, logical decoding in PostgreSQL, MySQL binary log (binlog) etc. This is a powerful capability, but useful only if there is a way to tap into these event logs and make it available to other services which depend on that information.

[Read More]

postgresql kafka azure debezium

Azure Event Hubs 'Role Based Access Control' in action

Posted on June 24, 2020 | 7 minutes |

Azure Event Hubs is streaming platform and event ingestion service that can receive and process millions of events per second. In this blog, we are going to cover one of the security aspects related to Azure Event Hubs.

Shared Access Signature (SAS) is a commonly used authentication mechanism for Azure Event Hubs which can be used to enforce granular control over the type of access you want to grant - it works by configuring rules on Event Hubs resources (namespace or topic). However, it is recommended that you use Azure AD credentials (over SAS) whenever possible since it provides similar capabilities without the need to manage SAS tokens or worry about revoking a compromised SAS.

[Read More]

azure event hubs security rbac azure

Kafka on Kubernetes, the Strimzi way! (Part 2)

Posted on June 17, 2020 | 7 minutes |

We kicked off the the first part of the series by setting up a single node Kafka cluster which was accessible to only internal clients within the same Kubernetes cluster, had no encryption, authentication or authorization and used temporary persistence. We will keep iterating/improving on this during the course of this blog series.

This part will cover these topics:

Expose Kafka cluster to external applications
Apply TLS encryption
Explore Kubernetes resources behind the scenes
Use Kafka CLI and Go client applications to test our cluster setup

The code is available on GitHub - https://github.com/abhirockzz/kafka-kubernetes-strimzi/
[Read More]

kafka kubernetes strimzi

Kafka on Kubernetes, the Strimzi way! (Part 1)

Posted on June 8, 2020 | 7 minutes |

Some of my previous blog posts (such as Kafka Connect on Kubernetes, the easy way!), demonstrate how to use Kafka Connect in a Kubernetes-native way. This is the first in a series of blog posts which will cover Apache Kafka on Kubernetes using the Strimzi Operator. In this post, we will start off with the simplest possible setup i.e. a single node Kafka (and Zookeeper) cluster and learn:

Strimzi overview and setup
Kafka cluster installation
Kubernetes resources used/created behind the scenes
Test the Kafka setup using clients within the Kubernetes cluster

The code is available on GitHub - https://github.com/abhirockzz/kafka-kubernetes-strimzi
[Read More]

kafka kubernetes strimzi

PostgreSQL pgoutput plugin for change data capture

What about the pgoutput plugin?

How to use Azure Go SDK to manage Azure Data Explorer clusters

Kafka on Kubernetes, the Strimzi way! (Part 4)

Tutorial: Getting started with Azure Data Explorer using the Go SDK

Orchestrate Azure Event Hubs via Kubernetes

Kafka on Kubernetes, the Strimzi way! (Part 3)

Change Data Capture architecture using Debezium, Postgres and Kafka

Azure Event Hubs 'Role Based Access Control' in action

Kafka on Kubernetes, the Strimzi way! (Part 2)

Kafka on Kubernetes, the Strimzi way! (Part 1)

What about the `pgoutput` plugin?