Episodes

  • LamRAG : From Data to Constructive Insights using Amazon Bedrock
    Sep 22 2024

    In this episode, Renaldi Gondosubroto and Sandeep Kumar dive into the core concepts of Large Language Models (LLMs), Prompt Engineering, Vector Databases, Retrieval-Augmented Generation (RAG), and Agents. Using Amazon Bedrock's capabilities, they’ll demystify how to leverage AI in real-world scenarios. Highlighting Sandeep's internal application, Feedbackly, they will discuss how it collects sprint feedback and provides actionable insights using advanced AI techniques. Listeners will gain practical insights into integrating AI with AWS services, from sentiment analysis to career progression recommendations, making complex concepts accessible and impactful.

    Show More Show Less
    44 mins
  • Thawing Java on AWS Lambda: Reducing Cold Start Times from 11 Seconds to 1
    Jun 9 2024

    Are your Java cloud functions too slow? What if they could start in milliseconds instead of seconds? And use less memory to boot?

    This is not a dream, but by changing the way we compile our Java functions we can significantly reduce memory usage while also drastically improving startup performance.

    Java has never been a perfect fit for Function as a Service platforms such as AWS Lambda or Azure Functions. While both platforms have official support for Java, Java functions unfortunately suffer from significantly longer cold start times than many other runtimes.

    In this episode we talk through a simple Spring Cloud Java function running on AWS Lambda with fairly horrible cold start times of around 6 seconds and then compare a few different approaches for significantly improving it. Eventually ending up with a cold start time of just 100 milliseconds - making Java a viable, though not without drawbacks, choice for FaaS platforms.

    Show More Show Less
    45 mins
  • Developing Performant and Secure Serverless and Generative AI Solutions with WebAssembly
    May 5 2024

    In this insightful episode, join me and Matt as we explore the transformative role of WebAssembly in enhancing serverless computing and generative AI workloads. We also delve into how WebAssembly is not just boosting performance but also addressing critical aspects of sustainability and security.

    Discover how these technologies are reshaping the landscape of development and deployment, and gain expert insights into the future possibilities they hold. Whether you're a developer, a tech enthusiast, or just curious about the next big thing in tech, this episode is your gateway to understanding the impact and potential of WebAssembly in the modern tech ecosystem.

    Matt Butcher is co-founder and CEO of Fermyon, the serverless WebAssembly in the cloud company. He is one of the original creators of Helm, Brigade, CNAB, OAM, Glide and Krustlet. He has written and co-written many books, including "Learning Helm" and "Go in Practice." He is a co-creator of the "Illustrated Children’s Guide to Kubernetes" series. These days, he works mostly on WebAssembly projects such as Spin, Fermyon Cloud and Bartholomew. He holds a Ph.D. in Philosophy. He lives in Colorado, where he drinks lots of coffee.

    Show More Show Less
    53 mins
  • Building Generative AI on Serverless Architectures within the Healthcare Industry
    Dec 10 2023

    The healthcare industry is a prominent setting where Generative AI has been seen to have created a large impact. In this episode, join me and Luca discussing how he has been using it in building Neosperience's projects, the use cases that it has had that has helped doctors and patients alike, and the techniques that are employed including with agents and Retrieval Augmented Generation (RAG). We will also discuss the regulatory considerations and constraints that come with building in this setting.

    Luca is the CTO and R&D manager at Neosperience and Neosperience Health and an AWS Serverless Hero.

    Show More Show Less
    52 mins
  • Building with Generative AI within Step Functions and Eventually Consistent Architectures
    Nov 26 2023

    In this episode, the spotlight goes on the power of step functions and eventually consistent architectures. Listen to me and Matt talk about how Generative AI workloads can be optimized within these architectures and learn best practices of using them alongside with other features and tools within AWS that best support this.

    Matt has been building web applications since the 1990s. Today his focus is on cloud and serverless. He’s dabbled in writing and speaking, and collects various links and artifacts at https://mattmorgan.cloud/. Matt thinks about failure all the time, but isn’t depressed.

    Show More Show Less
    32 mins
  • Conjuring Creativity with AI and Lambda
    Nov 12 2023

    In this episode, we cast the spotlight on AWS Lambda, the event-driven, serverless computing platform that's becoming the backbone of modern application development. Listen in as me and Girish dissect how Generative AI is becoming an integral part of this landscape, enhancing the agility and scalability of Lambda functions, and some Lambda projects we have worked on utilizing Generative AI. From reducing operational costs to driving innovation, we explore how this combination is not just streamlining deployment but also catalyzing a new era of efficient, intelligent application design.

    Girish Mukim is an AWS Solutions Architect with a remarkable 17-year career in the IT industry. His primary expertise lies in infrastructure projects, particularly in the realm of database management. Girish specializes in assisting clients with cloud transformation, including migrations from on-premises systems to the AWS cloud and cloud-native development using microservices. He is also deeply involved in crafting cloud adoption strategies, designing target architectures, and ensuring cost-effectiveness. Girish collaborates with AWS through the Migration Acceleration Program to provide clients with efficient migration solutions. He is actively engaged within the AWS community and a distinguished participant in the AWS Community Builders program. Girish recently has been accepted as an AWS Ambassador.

    Show More Show Less
    34 mins
  • The Generative AI Impact on CDK and Serverless Infrastructure as Code
    Oct 24 2023

    In our very first episode, join me and Bojan Zivic as we delve deep into the pioneering intersections of Cloud Development Kit (CDK), Serverless technologies, and the groundbreaking capabilities of Generative AI (GenAI). Infrastructure-as-Code (IaC) stands as a testament to tech evolution, and with GenAI entering the picture, we're on the cusp of an IaC revolution.

    Join us as we:

    • Decode the nuances of CDK and how GenAI augments its capabilities.
    • Explore the boundless possibilities presented by Serverless solutions when enhanced by GenAI.
    • Discuss real-world applications and success stories that paint a vivid picture of the future.

    Bojan is an AWS Ambassador and Serverless community builder, with a love for all things CDK and Serverless.

    Show More Show Less
    37 mins