Data Engineering Podcast

513 episodes — Page 2 of 11

Ep 462Exploring NATS: A Multi-Paradigm Connectivity Layer for Distributed Applications

SummaryIn this episode of the Data Engineering Podcast Derek Collison, creator of NATS and CEO of Synadia, talks about the evolution and capabilities of NATS as a multi-paradigm connectivity layer for distributed applications. Derek discusses the challenges and solutions in building distributed systems, and highlights the unique features of NATS that differentiate it from other messaging systems. He delves into the architectural decisions behind NATS, including its ability to handle high-speed global microservices, support for edge computing, and integration with Jetstream for data persistence, and explores the role of NATS in modern data management and its use cases in industries like manufacturing and connected vehicles.AnnouncementsHello and welcome to the Data Engineering Podcast, the show about modern data managementData migrations are brutal. They drag on for months—sometimes years—burning through resources and crushing team morale. Datafold's AI-powered Migration Agent changes all that. Their unique combination of AI code translation and automated data validation has helped companies complete migrations up to 10 times faster than manual approaches. And they're so confident in their solution, they'll actually guarantee your timeline in writing. Ready to turn your year-long migration into weeks? Visit dataengineeringpodcast.com/datafold today for the details.Your host is Tobias Macey and today I'm interviewing Derek Collison about NATS, a multi-paradigm connectivity layer for distributed applications.InterviewIntroductionHow did you get involved in the area of data management?Can you describe what NATS is and the story behind it?How have your experiences in past roles (cloud foundry, TIBCO messaging systems) informed the core principles of NATS?What other sources of inspiration have you drawn on in the design and evolution of NATS? (e.g. Kafka, RabbitMQ, etc.)There are several patterns and abstractions that NATS can support, many of which overlap with other well-regarded technologies. When designing a system or service, what are the heuristics that should be used to determine whether NATS should act as a replacement or addition to those capabilities? (e.g. considerations of scale, speed, ecosystem compatibility, etc.)There is often a divide in the technologies and architecture used between operational/user-facing applications and data systems. How does the unification of multiple messaging patterns in NATS shift the ways that teams think about the relationship between these use cases?How does the shared communication layer of NATS with multiple protocol and pattern adaptaters reduce the need to replicate data and logic across application and data layers?Can you describe how the core NATS system is architected?How have the design and goals of NATS evolved since you first started working on it?In the time since you first began writing NATS (~2012) there have been several evolutionary stages in both application and data implementation patterns. How have those shifts influenced the direction of the NATS project and its ecosystem?For teams who have an existing architecture, what are some of the patterns for adoption of NATS that allow them to augment or migrate their capabilities?What are some of the ecosystem investments that you and your team have made to ease the adoption and integration of NATS?What are the most interesting, innovative, or unexpected ways that you have seen NATS used?What are the most interesting, unexpected, or challenging lessons that you have learned while working on NATS?When is NATS the wrong choice?What do you have planned for the future of NATS?Contact InfoGitHubLinkedInParting QuestionFrom your perspective, what is the biggest gap in the tooling or technology for data management today?Closing AnnouncementsThank you for listening! Don't forget to check out our other shows. Podcast.__init__ covers the Python language, its community, and the innovative ways it is being used. The AI Engineering Podcast is your guide to the fast-moving world of building AI systems.Visit the site to subscribe to the show, sign up for the mailing list, and read the show notes.If you've learned something or tried out a project from the show then tell us about it! Email [email protected] with your story.LinksNATSNATS JetStreamSynadiaCloud FoundryTIBCOApplied Physics Lab - Johns Hopkins UniversityCray SupercomputerRVCM Certified MessagingTIBCO ZMSIBM MQJMS == Java Message ServiceRabbitMQMongoDBNodeJSRedisAMQP == Advanced Message Queueing ProtocolPub/Sub PatternCircuit Breaker PatternZero MQAkamaiFastlyCDN == Content Delivery NetworkAt Most OnceAt Least OnceExactly OnceAWS KinesisMemcachedSQSSegmentRudderstackPodcast EpisodeDLQ == Dead Letter QueueMQTT == Message Queueing Telemetry TransportNATS Kafka Bridge10BaseT NetworkWeb AssemblyRedPandaPodcast EpisodePulsar FunctionsmTLSAuthZ (Authorization)AuthN (Authentication)NATS Auth CalloutsOPA == Open Policy AgentRAG == Retrieval Augmented Gener

Data Engineering Podcast

Ep 462Exploring NATS: A Multi-Paradigm Connectivity Layer for Distributed Applications

Ep 461Advanced Lakehouse Management With The LakeKeeper Iceberg REST Catalog

Ep 460Simplifying Data Pipelines with Durable Execution

Ep 459Overcoming Redis Limitations: The Dragonfly DB Approach

Ep 458Bringing AI Into The Inner Loop of Data Engineering With Ascend

Ep 457Astronomer's Role in the Airflow Ecosystem: A Deep Dive with Pete DeJoy

Ep 456Accelerated Computing in Modern Data Centers With Datapelago

Ep 455The Future of Data Engineering: AI, LLMs, and Automation

Ep 454Evolving Responsibilities in AI Data Management

Ep 453CSVs Will Never Die And OneSchema Is Counting On It

Ep 452Breaking Down Data Silos: AI and ML in Master Data Management

Ep 451Building a Data Vision Board: A Guide to Strategic Planning

Ep 450How Orchestration Impacts Data Platform Architecture

Ep 449An Exploration Of The Impediments To Reusable Data Pipelines

Ep 448The Art of Database Selection and Evolution

Ep 447Bridging Code and UI in Data Orchestration with Kestra

Ep 446Streaming Data Into The Lakehouse With Iceberg And Trino At Going

Ep 445An Opinionated Look At End-to-end Code Only Analytical Workflows With Bruin

Ep 444Feldera: Bridging Batch and Streaming with Incremental Computation

Ep 443Accelerate Migration Of Your Data Warehouse with Datafold's AI Powered Migration Agent

Ep 442Bring Vector Search And Storage To The Data Lake With Lance

Ep 441The Role of Python in Shaping the Future of Data Platforms with DLT

Ep 440Build Your Data Transformations Faster And Safer With SDF

Ep 439Scaling Airbyte: Challenges and Milestones on the Road to 1.0

Ep 438Enhancing Data Accessibility and Governance with Gravitino

Ep 437The Evolution of DataOps: Insights from DataKitchen's CEO

Ep 436Achieving Data Reliability: The Role of Data Contracts in Modern Data Management

Ep 435How Generative AI Is Impacting Data Engineering Teams

Ep 434The Role of Product Managers in Data-Centric Organizations

Ep 433Neon: A Serverless And Developer Friendly Postgres

Ep 432Improve Data Quality Through Engineering Rigor And Business Engagement With Synq

Ep 431Stitching Together Enterprise Analytics With Microsoft Fabric

Ep 430Being Data Driven At Stripe With Trino And Iceberg

Ep 429X-Ray Vision For Your Flink Stream Processing With Datorios

Ep 428Practical First Steps In Data Governance For Long Term Success

Ep 427Data Migration Strategies For Large Scale Systems

Ep 426Zenlytic Is Building You A Better Coworker With AI Agents

Ep 425Release Management For Data Platform Services And Logic

Ep 424Barking Up The Wrong GPTree: Building Better AI With A Cognitive Approach

Ep 423Build Your Second Brain One Piece At A Time

Ep 422Making Email Better With AI At Shortwave

Ep 421Designing A Non-Relational Database Engine

Ep 420Establish A Single Source Of Truth For Your Data Consumers With A Semantic Layer

Ep 419Adding Anomaly Detection And Observability To Your dbt Projects Is Elementary

Ep 418Ship Smarter Not Harder With Declarative And Collaborative Data Orchestration On Dagster+

Ep 417Reconciling The Data In Your Databases With Datafold

Ep 416Version Your Data Lakehouse Like Your Software With Nessie

Ep 415When And How To Conduct An AI Program

Ep 414Find Out About The Technology Behind The Latest PFAD In Analytical Database Development

Ep 413Using Trino And Iceberg As The Foundation Of Your Data Lakehouse