PLAY PODCASTS
The AI & Tech Society by Danar

The AI & Tech Society by Danar

110 episodes — Page 1 of 3

Gemma 4: Google's Open-Source LLM Competing with Chinese Models

May 14, 202618 min

Musk vs. Altman: The OpenAI Legal Battle Explained

May 10, 202619 min

AI cut 16,000 U.S. jobs a month — what the Goldman Sachs report actually says

May 6, 202618 min

Claude Mythos: The Model Anthropic Chose Not to Release

May 4, 202619 min

OpenAI's GPT-5.5: AI Agents Just Went Pro

Apr 26, 202619 min

Claude Opus 4.7: The Quiet Upgrade

Apr 22, 202617 min

US vs. China: The AI Race Is Closer Than You Think 2026

Apr 18, 202621 min

KPIs are Dead: The New Metric AI Companies are Using Instead in 2026

Apr 15, 202619 min

S4 Ep 18OpenAI’s Bold 7-Point Industrial Policy for the AI Age

Five Strategic TakeawaysDocument signals regulatory direction on access, taxation, worker protections, safetyFour-day week changes conversation about who benefits from AI efficiencyWorker voice emerging as both ethical imperative and operational best practiceFrontier AI compliance requirements are comingRead with both charity and skepticismThe Test of SincerityWatch for:Does OpenAI implement four-day week internally?Do they accept monitoring that constrains their development?Do they modify proposals based on criticism?Do they advocate for policies against their commercial interest? Hosted on Acast. See acast.com/privacy for more information.

Apr 10, 202622 min

S4 Ep 17 The Anthropic Leak and What it Reveals About AI's Future

10-Component Prompt ArchitectureTask context (role/persona)Tone context (register)Background data (docs, code, guides)Detailed task description and rulesExamples (1-2 ideal outputs)Conversation historyImmediate task descriptionThink step-by-step instructionsOutput formattingPrefilled response (advanced)Strategic ImplicationsFor Developers:AI tools have more access than most employeesLeaked prompting framework is freely adoptableTreat "leaked code" repos as malwareFor Tech Leaders:Demand transparency on internal vs external differencesBuild dark code governance before incidentsApply vendor security assessment to AI toolsFor AI Strategy:Moat is model + trust, not harnessArchitecture secrecy is weak advantagePartial transparency worse than full transparency Hosted on Acast. See acast.com/privacy for more information.

Apr 6, 202625 min

S4 Ep 16AI News Roundup March 2026: GPT-5.4, Nvidia GTC, EU AI Act & Top Startups

Your complete AI news roundup for March 2026 — covering GPT-5.4’s human-surpassing benchmark performance, Nvidia’s Rubin GPU reveal at GTC 2026, OpenAI’s $110B funding round, DeepSeek V4’s open-source launch, and the EU AI Act’s approaching August enforcement deadline. Includes the latest in AI robotics, healthcare breakthroughs, Swedish AI policy, startup investments, chip hardware updates, and consumer adoption trends. Essential reading for AI leaders, developers, and business decision-makers staying ahead of the fast-moving artificial intelligence landscape.Seven Key TakeawaysAI is simultaneously superhuman and subhuman by taskFunding concentration is extreme (83% to top 3)Consumer sentiment matters (QuitGPT forced contract changes)Open source catching up faster than expectedSovereign AI infrastructure acceleratingAgentic AI has moved to productionSkills premium is real but treadmill accelerating Hosted on Acast. See acast.com/privacy for more information.

Apr 2, 202624 min

S4 Ep 15Claude Code: How Anthropic is using Claude Code

Claude Code: How Anthropic is using Claude CodeKey Quotes from Anthropic LeadersBoris Cherny, Head of Claude Code:"I think by the end of the year, everyone is going to be a product manager, and everyone codes. The title software engineer is going to start to go away. It's just going to be replaced by 'builder,' and it's going to be painful for a lot of people.""I think at this point it's safe to say that coding is largely solved.""I have not edited a single line by hand since November."Dario Amodei, CEO:"I think we will be there in three to six months, where AI is writing 90% of the code. And then, in 12 months, we may be in a world where AI is writing essentially all of the code."Jack Clark, Co-founder:"Something that we found is that the value of more senior people with really, really well-calibrated intuitions and taste is going up."The Eight Best PracticesInvest in CLAUDE.md documentation — Configuration files Claude reads at startupClassify tasks: async vs synchronous — Know what to supervise vs delegateCreate self-sufficient verification loops — Tests before code, auto-run builds/lintsStart from clean git state — Checkpoint commits enable safe experimentationUse MCP servers for sensitive data — Better logging and access controlBuild multi-instance parallel workflows — Multiple Claude instances across reposUse screenshots and multimodal input — Figma, dashboards, UI imagesPrompt for simplicity — Interrupt and ask "Try something simpler"The AI PM Cert visit: https://aipmcert.com/ Hosted on Acast. See acast.com/privacy for more information.

Mar 29, 202626 min

S4 Ep 14What People Actually Want from AI

Episode: What 81,000 People Want From AI: The Most Human AI Report So FarStudy: Anthropic Global AI Survey (December 2025)80,508 Claude users interviewed159 countries70 languagesAI-conducted open-ended conversationsPrimary Aspirations (What People Want)CategoryPercentageProfessional Excellence18.8%Personal Transformation13.7%Life Management13.5%Time Freedom11.1%Financial Independence9.7%Key insight: Productivity is often the surface story. When asked what productivity enables, people reveal deeper wants: family time, mental health, meaningful work, paths out of precarity. Hosted on Acast. See acast.com/privacy for more information.

Mar 22, 202627 min

S4 Ep 13AI Politics in 2026: Pentagon AI Military

The Core DisputePentagon Position:Requires "all lawful use" provisions from AI vendorsWants flexibility for future applicationsFocused on Golden Dome, drone swarms, autonomous systemsAnthropic Position:Two non-negotiables: no mass surveillance of Americans, no fully autonomous weaponsWill not sign contracts creating legal pathways to prohibited usesChallenging supply chain risk designation in courtOpenAI Position:Explicit contractual prohibitions on mass surveillance, autonomous weapons, high-stakes automated decisionsCloud-only deployments with OpenAI personnel in loopMaintains control over safety stackWhat the Military Wants AI ForCurrent Uses:Intelligence analysisCyber operationsOperational planningThreat assessmentModeling and simulationClassified environment support Hosted on Acast. See acast.com/privacy for more information.

Mar 16, 202619 min

S4 Ep 12AI and Jobs in 2026

Episode: AI and Jobs in 2026: What Anthropic's Labor Report Really Means for Workers, Policy, and BusinessReport: Anthropic Economic Index Labor Market Analysis (March 5, 2026)The Headline FindingNo mass displacement yet, but entry is getting harder:No systematic increase in unemployment for AI-exposed occupationsJob-finding rates for workers aged 22-25 in exposed fields: down ~14% vs 2022Unemployment rates: flatFirst visible effect: fewer young people getting their first footholdObserved Exposure: The New MeasureComponentWhat It MeasuresTheoretical Capability% of tasks LLMs could theoretically performObserved UsageWhat people actually do with Claude at workObserved ExposureCombined measure weighted toward automated/work-related usesWhy it matters: Labor markets are shaped by adoption, workflow design, regulation, and trust—not just model demos. Hosted on Acast. See acast.com/privacy for more information.

Mar 8, 202619 min

S4 Ep 11AI News: ChatGPT Ads, Superbowl, Pentagon AI and Seedance 2.0

Coding Model Releases (Feb 12)All three dropped same day:OpenAI: GPT-5.3-Codex-Spark (purpose-built for engineering workflows)Google: Gemini 3 Deep ThinkAnthropic: Major funding round announcementThree-way battle for developer mindshare officially a sprintPentagon AI StrategyFramework: Five "Priority Sprint Projects"Initiatives:GenAI.mil for all-classification AI accessEnterprise agents playbookMandate: All military departments must identify 3+ priority AI projects within 30 daysLanguage:"Any lawful use" in procurement"Military AI dominance" framingDisney vs. ByteDanceAction: Cease-and-desist letters (Feb 14)Target: Seedance 2.0 video generationAccusation: Generating copyrighted characters (Star Wars, Marvel)MPA Statement: "Unauthorized use of U.S. copyrighted works on a massive scale"Implication: AI copyright fight moves from theoretical to legalHBR Productivity StudySource: UC Berkeley study in Harvard Business ReviewFinding: AI users worked faster, took on more tasks, worked longer hours—often without being askedImplication: AI isn't reducing workload—it's intensifying itRecommendation: Managers must design for outcomes, not just outputChinese AI DevelopmentsReleases (mid-February):DeepSeek V4: 1 trillion parameters, coding-focusedAlibaba Qwen 3.5ByteDance Doubao upgradeCost Advantage (RAND): Chinese models run at 1/6 to 1/4 cost of comparable U.S. systemsMarket Share: DeepSeek holds ~89% among AI users in ChinaSpotify Engineering TransformationAnnouncement (Feb 12): Top developers haven't manually written code since DecemberTools:Claude CodeInternal system "Honk"Shift: Engineers are now "full-time AI orchestrators"Implication: Future of engineering is operational, not hypotheticalKey TakeawaysCommercialization-safety tension is real — Ads + safety team dissolution not coincidentalBrand positioning matters — 11% user bump from values messagingCoding model wars intensifying — Three releases same dayGovernment AI accelerating — 30-day Pentagon mandateCopyright enforcement getting real — Disney vs. ByteDanceAI may increase workload — Design for outcomes, prevent burnoutCompanies MentionedOpenAI, Anthropic, Google, Disney, Paramount, ByteDance, Spotify, DeepSeek, Alibaba, Motion Picture Association, Department of DefensePeople MentionedSam Altman (OpenAI CEO)Joshua Achiam (OpenAI, now "chief futurist")Studies ReferencedUC Berkeley/HBR: AI and workload intensificationBNP Paribas: Super Bowl ad effectivenessRAND: Chinese AI cost analysis Hosted on Acast. See acast.com/privacy for more information.

Feb 21, 202617 min

S4 Ep 10AGI: Sam Altman , Dario Amodei & Demis Hassabis Vision

Today we're doing something different. Instead of covering the news cycle, we're going deep on the three people who will likely shape how AGI arrives: Sam Altman of OpenAI, Dario Amodei of Anthropic, and Demis Hassabis of Google DeepMind.Each has a distinct philosophy about how to build transformative AI, what the risks are, and what happens to society when we get there. Understanding these differences isn't academic. These philosophies are determining the products we use, the policies being debated, and potentially the trajectory of human civilization. Hosted on Acast. See acast.com/privacy for more information.

Feb 15, 202622 min

S4 Ep 9AI News February 1-8, 2026: The $650 Billion AI Arms Race Explodes

Key TakeawaysModel race is now platform race (Cowork vs Frontier)$650B Big Tech capex is the new realityProfessional software under genuine threatHardware competition intensifying (AMD, Broadcom)Regulatory complexity growing (federal vs state)AI adoption mainstream but returns concentratedSuper Bowl ads signal consumer battlegroundCompanies MentionedAnthropic, OpenAI, Alphabet/Google, Amazon, Meta, Microsoft, NVIDIA, AMD, Broadcom, Thomson Reuters, LegalZoom, HP, Intuit, Oracle, State Farm, Uber, Cisco, BBVA, T-Mobile, Cerebras, Goodfire, Bedrock Robotics, Sana, Perplexity, Boston Dynamics, Caterpillar, Khan Academy Hosted on Acast. See acast.com/privacy for more information.

Feb 10, 202625 min

S4 Ep 8AI News January 26-30, 2026: The Verticalization Era Begins

Major Launches This WeekProductCompanyDomainKey FeaturePrismOpenAIScienceGPT-5.2 with 400K token context for researchGOV.UK AssistantAnthropicGovernmentAgentic employment support for UKPersonal IntelligenceGoogleConsumerGmail + Photos integration in AI ModeAI Overviews upgradeGoogleSearchGemini 3 default, follow-up questionsOpenAI Prism DetailsModel: GPT-5.2400,000-token context window (~800 pages)Fine-tuned for mathematical and scientific reasoningNative LaTeX understandingVisual Synthesis for diagrams to codePricing:Personal: Free (unlimited projects/collaborators)Education: Institutional tier (TBD)Enterprise: Compliance features (TBD)Built on: Acquired startup Crixet (LaTeX platform)Competition:Overleaf (LaTeX collaboration)Mendeley/Zotero (reference management)Google Scholar integration (anticipated)TRAIN Act SummaryName: Transparency and Responsibility for Artificial Intelligence Networks ActSponsors: Rep. Madeleine Dean (D-PA), Rep. Nathaniel Moran (R-TX)Key Provisions:Administrative subpoena for training data disclosure"Subjective good faith belief" standard for requestsNon-compliance creates "rebuttable presumption of copying"Impact: Gives copyright holders discovery rights previously unavailableHardware & InfrastructureASML Q4 2025:Orders: €13.2B ($15.8B) — record quarterAnalyst forecast: €6.85B (far exceeded)Q4 sales: €9.72BFull 2025 sales: €32.7BStock surge: ~6%Intel: Activated ASML EXE:5200 High-NA EUV systemReduces manufacturing steps: 40 → 10Spending Forecasts (Gartner):2026: $2.53 trillion2027: $3.33 trillion Hosted on Acast. See acast.com/privacy for more information.

Feb 6, 202626 min

S4 Ep 7Davos WEF 2026: Elon Musk, Satya Nadella & AI’s Tsunami – 7 Brutal Truths for Leaders

In this episode, we break down the most uncomfortable AI truths that surfaced at Davos WEF 2026 – from IMF chief Kristalina Georgieva calling AI a “tsunami” for the global job market to Anthropic CEO Dario Amodei warning that 50% of white-collar jobs could be disrupted within five years. We unpack Elon Musk’s claim that energy, not algorithms, is now the real AI bottleneck, and NVIDIA’s Jensen Huang framing AI as the largest infrastructure build-out in human history. You’ll hear how McKinsey’s new agentic AI narrative ties into a projected 2.9 trillion dollars in value, why OpenAI, Microsoft, and Google are racing for the AI interface layer, and what Demis Hassabis, Satya Nadella, and Yuval Noah Harari really signaled about AGI, open vs closed models, and “everything made of words” being eaten by AI. Perfect for founders, executives, and policymakers who want the real story behind Davos 2026 and what it means for jobs, power, and leadership in the AI era. Hosted on Acast. See acast.com/privacy for more information.

Feb 4, 202618 min

S4 Ep 6AI News January 19-23 2026: Weekly AI News

Davos HeadlinesKristalina GeorgievaIMF40% of jobs "touched by AI," labor market "tsunami"Dario AmodeiAnthropicAI will replace all developers in 1 year, Nobel-level science in 2 yearsDemis HassabisGoogle DeepMindAGI has 50% chance within decade, current systems "nowhere near" human-levelSatya NadellaMicrosoftEnergy costs, not model quality, will determine AI winnersDonald TrumpU.S. President300GW nuclear capacity target by 2030Jensen HuangNVIDIARobotics is "once-in-lifetime opportunity" for EuropeYuval Noah HarariPhilosopherWarning about AI manipulation potentialBig Tech NewsOpenAI Consumer DeviceConfirmed for H2 2026 launchLikely screenless, possibly wearableHealthcare applications following Torch Health acquisitionThinking Machines CrisisThree co-founders left: Barret Zoph, Luke Metz, Sam SchoenholzAll returned to OpenAIMurati fired Zoph citing "unethical conduct"$12B startup facing existential talent crisisLightricks LTX-2 LaunchAudio-to-video generation with ElevenLabs partnershipAudio becomes control layer for video generation4K/50fps output capabilityOpen-source with complete training codeAPI access launched January 27Energy and InfrastructureData center power: 55GW current → 84GW by 2028 (Goldman Sachs)Trump targeting 300GW new nuclear by 2030Nadella: "Tokens are new commodity"Energy costs = AI competitive advantageLabor Market DataSourceFindingIMF40% of jobs touched by AIBCG AI Radar50% of CEOs believe jobs depend on AI successMcKinseyTwo-thirds haven't scaled AI enterprise-wideMicrosoft ResearchFinance, legal, software engineering most exposedGartner$1.5 trillion invested in AI in 2025Product LaunchesProductCompanyCapabilityLTX-2 Audio-to-VideoLightricks + ElevenLabsGenerate video from audio controlChat with PDFAdobe AcrobatPrompt-based PDF editing and summariesReal Talk + VideoMicrosoft CopilotHuman-like conversation, video generationClaude in ChromeAnthropicBrowser extension for in-page AI assistanceAuth0 for AI AgentsAuth0Identity management for AI agentsScroll AIScrollKnowledge base to AI expert conversionLTX-2 Technical HighlightsArchitecture:14B parameter video stream5B parameter audio streamCross-attention for synchronizationSingle diffusion pass generationCapabilities:4K/50fps output10-20 second clipsLoRA fine-tuning in under 1 hourAudio-first control paradigmOpen Source:Complete training code releasedArchitecture documentationCommunity fine-tuning enabledKey Takeaways for LeadersWorkforce transformation is urgent (40% jobs affected)Energy strategy is AI strategy (tokens as commodity)AI startup ecosystem is volatile (talent wars)New form factors coming (OpenAI device H2 2026)Audio-video generation is production-ready (LTX-2)ROI pressure is real (2/3 failing to scale)AI safety includes manipulation risks (Harari warning)People MentionedKristalina Georgieva (IMF Managing Director)Dario Amodei (Anthropic CEO)Demis Hassabis (Google DeepMind CEO)Satya Nadella (Microsoft CEO)Jensen Huang (NVIDIA CEO)Donald Trump (U.S. President)Mira Murati (Thinking Machines CEO, former OpenAI CTO)Chris Lehane (OpenAI Chief Global Affairs Officer)Yuval Noah Harari (Philosopher)Barret Zoph, Luke Metz, Sam Schoenholz (former Thinking Machines)Companies MentionedOpenAI, Anthropic, Google DeepMind, Microsoft, NVIDIA, Meta, Thinking Machines, Lightricks, ElevenLabs, Adobe, Auth0, Scroll AI, DeepSeek, Moonshot AI, Runway, Pika Hosted on Acast. See acast.com/privacy for more information.

Jan 26, 202624 min

S4 Ep 5AI News January 12-19 2026: ChatGPT Health & Math Breakthrough

Weekly AI news Jan 12-19 2026 covers explosive healthcare wars with ChatGPT Health, Claude Healthcare, AI cracking unsolved math, $25B Anthropic raise, ChatGPT ads. Hosted on Acast. See acast.com/privacy for more information.

Jan 21, 202634 min

S4 Ep 4Agentic Commerce 2026

Agentic commerce is revolutionizing retail with AI shopping agents. Learn how UCP, A2A, AP2, and MCP protocols enable autonomous AI purchasing in 2026. Hosted on Acast. See acast.com/privacy for more information.

Jan 16, 202627 min

S4 Ep 3AI Agents 2026: 7 Powerful Shifts That Will Redefine Work

AI Agents 2026: 7 Powerful Shifts That Will Redefine WorkAgent Competitive LandscapeModel Layer:OpenAI (ChatGPT Agent)Anthropic (Claude, computer use)Google (Gemini, agent frameworks)Infrastructure Layer:LangChainCrewAIAutoGenEnterprise Layer:Microsoft (Copilot, Agent 365)Salesforce (Agent platform)ServiceNowKey Risk CategoriesRisk TypeDescriptionReliabilityAgent errors, hallucinations in productionSecuritySystem access creates attack surfacesComplianceAgent actions may not meet regulationsControlMulti-agent systems behave unexpectedlyWorkforceHuman transition not managed wellKey Statistics ReferencedSalesforce: 50% of support via AI agentsSalesforce: Support staff reduced 9,000 → 5,000KPMG: 33% of organizations deployed agents (3x increase)Gartner: 33% of enterprise software will include agentic AI by 2028Gartner: 15% of daily work decisions made autonomously by 2028CTO Takeaways SummaryShift 1: Design agent roles, ownership, and KPIs like human teammatesShift 2: Architecture beats hype. Map value chain and build agent workflowsShift 3: Ask what 50% agent-handled interactions would mean for youShift 4: Build escalation architecture with explicit human decision pointsShift 5: Agent memory architecture is strategic. Own your context dataShift 6: Build on standard protocols (MCP). Design for multi-agent futureShift 7: Model economics at varying agent augmentation levelsResourcesGoogle AI Agent Trends 2026 ReportModel Context Protocol (MCP) DocumentationAgent Orchestration Conceptual Guide Hosted on Acast. See acast.com/privacy for more information.

Jan 12, 202621 min

S4 Ep 2What Happened in AI and Tech December 2025: Key Stories for Leaders

Key Takeaways for LeadersModel race is genuinely competitive (no single winner)Agentic AI becoming infrastructureCosts collapsing across providersRegulatory complexity increasingWorkforce impacts real and acceleratingInfrastructure investments unprecedentedAI creativity may be more limited than assumedCompanies MentionedMeta, Amazon, Apple, Google, OpenAI, Anthropic, NVIDIA, DeepSeek, Mistral, AMD, Microsoft, Salesforce, ByteDance, Alibaba, EQT, 1X Technologies, Unitree, Fal, Lovable, Databricks, Unconventional AI, SAP, Volvo, Stena Line, ManusPeople MentionedNaveen Rao (Unconventional AI)John Giannandrea (Former Apple AI Chief)Marty Makary (FDA Commissioner)John Carreyrou (Pulitzer winner, copyright plaintiff)Yann LeCun (Meta, world model startup)Ron DeSantis (Florida Governor)Donald Trump (US President) Hosted on Acast. See acast.com/privacy for more information.

Jan 7, 202622 min

S4 Ep 1Sam Altman AI 2026: ChatGPT-6 and OpenAI Predictions

Sam Altman AI 2026: Inside OpenAI's Code Red Battle with Google and the GPT-6 RoadmapSam Altman AI 2026 predictions have sent shockwaves through Silicon Valley. In this episode of The AI and Tech Society, we break down the most important insights from Alex Kantrowitz's December 2025 interview with the OpenAI CEO.What You'll Learn:OpenAI declared "code red" after Google Gemini 3 outperformed ChatGPT on major benchmarks. Sundar Pichai's team now has 650 million monthly Gemini users and distribution across Gmail, YouTube, Chrome, and Android. Sam Altman warned employees of "rough vibes" and delayed advertising, shopping agents, and healthcare AI to focus entirely on ChatGPT quality.The Sam Altman AI 2026 roadmap reveals GPT-6 class models arriving Q1 2026 with step-change reasoning capabilities. But the bigger surprise is what consumers actually want. Altman says users no longer need more IQ from AI. They want faster responses, better experiences, and seamless integration.We cover all 8 predictions including the shift from chatbots to agentic operating systems, enterprise hunger for deep reasoning, AI systems that discover novel scientific insights, and the March 2028 target for automated AI researchers.Episode Highlights:The Code Red declaration and what it means for OpenAI vs Google Gemini competition. Why Marc Benioff switched from ChatGPT to Gemini and what that signals. GPT-6 timeline and the Q1 2026 capability jump. Consumer experience over model intelligence. Setting intentions vs micro-managing AI with prompts. Enterprise reasoning needs vs consumer speed demands. AI research intern capabilities coming in 2026. AGI roadmap through March 2028. ChatGPT evolving from chatbot to operating system.Key People Discussed:Sam Altman, CEO of OpenAI. Sundar Pichai, CEO of Google. Marc Benioff, CEO of Salesforce. Alex Kantrowitz, Big Technology.Companies Covered:OpenAI, Google, Anthropic, Meta, Salesforce, MicrosoftModels Referenced:GPT-5, GPT-6, ChatGPT, Gemini 3, Claude, LlamaThis Sam Altman AI 2026 analysis is essential listening for tech leaders, product managers, AI engineers, and investors navigating the most competitive moment in AI history.Source: Recap and key takeaways from Alex Kantrowitz's Big Technology interview with Sam Altman, December 2025. Hosted on Acast. See acast.com/privacy for more information.

Jan 4, 202620 min

S3 Ep 49The 5 Most Significant AI Developments of 2025

AI Developments 2025: 5 Breakthrough Moments That Changed EverythingAI developments 2025 transformed the technology landscape more dramatically than any year in history. From DeepSeek shocking Silicon Valley to OpenAI hitting $1 billion in monthly revenue, the pace of change was relentless. In this episode of The AI and Tech Society podcast, Dr. Marcus Webb breaks down the five most significant AI developments that reshaped how we work, build, and compete.Whether you follow OpenAI, Anthropic, Google, Meta, or emerging players like DeepSeek and xAI, this comprehensive review covers every major milestone that defined the year. Hosted on Acast. See acast.com/privacy for more information.

Dec 16, 202520 min

S3 Ep 48AI Product Evaluations for Product Managers

AI Evaluations Masterclass: How Product Managers and Tech Leaders at Top Companies Build Reliable AI SystemsAre you shipping AI features without knowing if they actually work? In this comprehensive episode of The AI and Tech Society, AI and tech leader Danar Mustafa delivers the definitive guide to AI evaluations—the systematic approach that separates production-ready AI from expensive failures.What You'll Learn:🔹 AI Evaluation Fundamentals – Understand what AI evals are, why LLM evaluation differs from traditional ML, and the five dimensions every team must measure: performance, robustness, fairness, factuality, and consistency.🔹 The 9-Step Evaluation Process – A field-tested framework covering everything from defining success metrics to continuous monitoring, used by engineering teams at leading tech companies like Anthropic, OpenAI, Google, Meta, and Microsoft.🔹 Complete Tools Comparison – Deep dive into the best AI evaluation frameworks:Promptfoo for prompt engineering and model comparisonRAGAS for RAG pipeline evaluationDeepEval for pytest-style LLM testingLangSmith and LangFuse for tracing and observabilityTruLens for inline feedbackArize Phoenix for LLM debuggingMLflow Evaluate for experiment trackingDeepchecks and EvidentlyAI for drift detectionRobustness Gym for adversarial testing🔹 CI/CD Integration – Copy-paste implementation plan for automating AI quality gates in your development pipeline, including specific thresholds for hallucination detection, accuracy regression, and safety violations.🔹 Real-World Patterns – Battle-tested evaluation setups for customer support AI, HR chatbots, RAG assistants, and content moderation systems deployed at scale.🔹 PM vs. Engineering Roles – Clear guidance on how product managers should lead evaluation strategy while engineers operationalize the technical infrastructure.Perfect For:Product Managers building AI-powered featuresMachine Learning Engineers deploying LLMs to productionEngineering Leaders establishing AI quality standardsTech Leaders at startups and enterprises adopting generative AIAnyone working with ChatGPT, Claude, Gemini, Llama, or other foundation modelsTools & Technologies Discussed: Promptfoo, RAGAS, DeepEval, LangSmith, LangFuse, TruLens, Arize Phoenix, MLflow, Deepchecks, EvidentlyAI, Robustness Gym, OpenAI Evals, LangChain, pytest, CI/CD pipelines, GitHub ActionsKeywords: AI evaluations, AI evals, LLM evaluation, machine learning testing, AI quality assurance, prompt engineering, RAG evaluation, hallucination detection, AI safety testing, MLOps, LLMOps, AI product management, generative AI deployment, foundation models, ChatGPT evaluation, Claude evaluation, AI metrics, model monitoring, AI observabilityWhether you're at a Fortune 500 enterprise, a high-growth startup, or a tech giant like Amazon, Google, Microsoft, Meta, or Apple, this episode provides the blueprint for shipping AI that users trust.Subscribe to The AI and Tech Society for weekly insights on artificial intelligence, machine learning, and technology leadership. Hosted on Acast. See acast.com/privacy for more information.

Dec 9, 202529 min

S3 Ep 47Key November 2025 Advances in AI and Technology

The AI and Tech Society Podcast - November 2025 EpisodeExplore the groundbreaking AI developments of November 2025: GPT-5.1, Gemini 3, and Claude Opus 4.5 model releases; massive investments including Microsoft's $5B Anthropic deal and AWS-OpenAI's $38B partnership; Cursor's $29.3B valuation; real-world AI deployments in healthcare and robotics; 54.6% US adult AI adoption rate; and what these advances mean for jobs, businesses, and the future of work. Tech leader Danar breaks down how AI crossed from emerging technology to mainstream infrastructure, with actionable insights for engineers, leaders, and professionals navigating this transformation.Keywords: AI developments 2025, GPT-5.1, Gemini 3, Claude Opus 4.5, Cursor AI valuation, AI investments, Microsoft Anthropic, AI adoption rates, future of work, AI transformation, tech industry trends, AI coding tools, robotics deployment, enterprise AI strategy Hosted on Acast. See acast.com/privacy for more information.

Dec 3, 202538 min

S3 Ep 46Estimating AI Productivity Gains from Claude Conversations

Today we have a truly fascinating episode for you. We're diving deep into one of the most important questions in technology right now: How much is AI actually boosting productivity? And not in some theoretical sense—we're talking about real-world data from millions of actual conversations from Claude AI. Hosted on Acast. See acast.com/privacy for more information.

Nov 27, 202523 min

S3 Ep 45Meta and Amazon Layoffs in the Age of AI: An Industry Shift

The past few years have seen unprecedented layoffs at tech giants Meta and Amazon, with software engineers and other tech workers among those affected. These cuts are not just cost-saving measures; they are intertwined with a strategic refocus on artificial intelligence (AI) and automation. Both companies have signaled a pivot toward AI-intensive initiatives, even as they trim roles in traditional software and business operations. This article analyzes the timeline and scope of the layoffs at Meta and Amazon, examines leadership statements about evolving AI strategy, and explores evidence of resources being redirected from conventional software projects to AI and machine learning. We’ll also discuss what this means for software engineers – whether it’s a story of job displacement or role transformation – and how it reflects broader industry trends of AI prioritization and automation. Hosted on Acast. See acast.com/privacy for more information.

Nov 20, 202533 min

S3 Ep 44State of AI 2025: McKinsey Report

The State of AI 2025 from McKinsey provides a reality check: AI adoption is nearly universal, yet real transformation remains concentrated among a small group of high-performing organizations. The hype is massive but the gap between excitement and enterprise-level impact is still wide.In this post, I want to break down the findings from the report through the lens of someone who builds and deploys AI systems, often for organizations trying to scale beyond pilots. These insights reflect both what the data shows and what I see daily in my conversations with executives, engineers, and AI strategy teams. Hosted on Acast. See acast.com/privacy for more information.

Nov 14, 202510 min

S3 Ep 43Cursor AI: The AI Code Editor Transforming Software Development

Cursor AI: The AI Code Editor Transforming Software DevelopmentCursor AI was created by a team of four MIT graduates – Michael Truell, Aman Sanger, Sualeh Asif, and Arvid Lunnemark – who founded the company (Anysphere, Inc.) in 2022. They launched the first version of Cursor in 2023 with the vision of an AI-native coding environment that could do much more than simple autocomplete. Early on, their prototype gained traction among developers for its ability to “understand, write, and debug code” alongside the user. This momentum helped the founders secure an $8 million seed round in 2023, led by OpenAI’s Startup Fund – a strong vote of confidence that gave Cursor access to capital (and cutting-edge AI models) to accelerate development.https://digitalstrategy-ai.com/2025/11/07/cursor-ai-business-model/ Hosted on Acast. See acast.com/privacy for more information.

Nov 9, 202527 min

S3 Ep 42Key October 2025 Advances in AI and Technology

We cover the high-stakes battle among Big Tech — OpenAI, Microsoft, Google, and NVIDIA — and the startup surge led by Poolside, Harvey, Mercor, and Fireworks AI. Plus a global workforce snapshot showing where AI anxiety is rising fastest. Hosted on Acast. See acast.com/privacy for more information.

Nov 4, 20257 min

S3 Ep 41ChatGPT Usage Trends 2025: 3 Surprising findings

As an AI and technology leader, I’ve witnessed firsthand how generative AI has transformed how people work, learn, and create. The newly released OpenAI study, “How People Use ChatGPT” (September 2025), offers the first large-scale, privacy-preserving analysis of how millions of people worldwide are actually using ChatGPT. The results reveal fascinating shifts in human–AI interaction — with implications far beyond productivity alone.This study analyzed over 18 billion weekly messages sent by 700 million users, representing nearly 10% of the global adult population. What it uncovers about ChatGPT usage trends in 2025 paints a vivid picture of AI adoption at scale — from the boardroom to the classroom, and from Silicon Valley to emerging markets.The Rise of Everyday AI Hosted on Acast. See acast.com/privacy for more information.

Oct 24, 20259 min

S3 Ep 40Inside DevDay 2025: What OpenAI Just Changed for Developers

Today we’re unpacking everything OpenAI announced at DevDay 2025 — and what it really means for developers and engineering teams Hosted on Acast. See acast.com/privacy for more information.

Oct 10, 202510 min

S3 Ep 39Key September 2025 Advances in AI and Technology

From OpenAI’s Sora 2 and Anthropic’s Claude Sonnet 4.5 to Alibaba’s trillion-parameter Qwen-3-Max, this episode unpacks the biggest breakthroughs shaping the AI landscape. We dive into policy shifts like the U.S. AI Risk Evaluation Act, DeepMind’s warnings, robotics and smart-city research, and the infographic revealing how people actually use ChatGPT. Join us for sharp insights on AI tools, startups, investments, and how Europe and Sweden are building their own sovereign AI future.Keywords: AI news 2025, ChatGPT trends, OpenAI Sora 2, Claude Sonnet 4.5, AI startups, EU AI Act, robotics, tech podcast, AI policy, machine learning, generative AI, AI tools, artificial intelligence updates. Hosted on Acast. See acast.com/privacy for more information.

Oct 6, 202512 min

S3 Ep 38A new OpenAI report on can AI do your job? The results are wild.

OpenAI’s GDPval study reveals how AI impacts jobs across industries. Learn what it means for workers, leaders, and the future of work.https://cdn.openai.com/pdf/d5eb7428-c4e9-4a33-bd86-86dd4bcf12ce/GDPval.pdf Hosted on Acast. See acast.com/privacy for more information.

Sep 28, 20256 min

S3 Ep 37AI Wins Coding Olympics 2025: Shocking Future of Programming

AI Wins Coding Olympics 2025: Shocking Future of ProgrammingICPC 2025: How AI Wins Coding Olympics Against the BestWhat It Means When AI Wins Coding Olympics CompetitionsThe Double-Edged Sword for Junior DevelopersHow Leaders Should RespondSkills That Matter in the Age Where AI Wins Coding OlympicsA Personal ReflectionConclusion: Humans Still Win by Working With AI Hosted on Acast. See acast.com/privacy for more information.

Sep 23, 20258 min

S3 Ep 36The Future Organization with AI Agents

When AI Becomes Part of the TeamAs AI evolves from passive tools to active agents, the fundamentals of how companies organize themselves are changing. AI agents can act independently, make decisions, and perform work tasks—qualities that make them more like digital colleagues than traditional software. This shift affects everything from corporate structure to leadership. Hosted on Acast. See acast.com/privacy for more information.

Sep 17, 20258 min

S3 Ep 35Key August 2025 Advances in AI and Technology

Key August 2025 AI Breakthroughs: GPT-5, Genie 3, and Global Adoption Trends Dive into the biggest AI news of August 2025 — OpenAI’s GPT-5 launch, DeepMind’s Genie 3 world model, record Big Tech investments, and the EU’s new AI Act. We also explore how AI is transforming healthcare, education, and professional workflows, plus fresh research connecting brain activity to LLMs. Hosted on Acast. See acast.com/privacy for more information.

Sep 9, 202512 min

S3 Ep 34The Evolution of AI Agents: A Technical Leader's Journey Through Six Critical Phases

From basic LLMs to autonomous enterprise systems - lessons learned building AI agents at scale Hosted on Acast. See acast.com/privacy for more information.

Sep 1, 202510 min

S3 Ep 33Understanding the AI Product Manager Role

Cracking the Code of the AI Product Manager RoleOver the past months, we have been diving deep into the evolving role of the AI Product Manager (AI PM)—through industry research, interviews, case studies, and real-world stories from practitioners at companies like Microsoft, Google, and Meta. What I’ve found is both exciting and messy: the AI PM is one of the most critical emerging roles in tech, but it’s also one of the hardest to define. Hosted on Acast. See acast.com/privacy for more information.

Aug 28, 20259 min

S3 Ep 3210 AI Trends That Will Shape the Next 6 Months

There is an incredible amount happening in AI right now, and the coming months will be crucial for how we integrate this technology into our daily lives. Here are my predictions on the biggest trends that will shape the near future. Hosted on Acast. See acast.com/privacy for more information.

Aug 26, 20256 min

S3 Ep 31Inside Google’s AI Playbook: What CTOs and Architects Need to Know

How Google Is Actually Using AI at WorkAI isn’t just theory at Google—it’s embedded in the way teams code, sell, market, and even eat lunch. The company recently shared 14 practical ways AI is making work faster, smarter, and more creative. Here’s the highlight reel every executive should know:Code at Scale: About 30% of new code is now generated with AI, helping engineers move faster while keeping quality high.Velocity Boost: AI support in testing, reviews, and migrations translates into a 10% bump in engineering speed.Smarter Bug Fixing: Duplicate issues are auto-resolved before they hit human queues, freeing teams to focus on the critical stuff.Creative Spark: Marketing and design teams use Gemini to brainstorm campaigns, test visuals, and even co-create event graphics and videos.Content Edge: YouTube and comms teams tap AI for catchy titles, timestamps, and summaries that improve reach.Sales Acceleration: AI has powered a 78% jump in RFP completions and a 14% improvement in lead conversions.Meeting Superpowers: AI-driven transcripts and summaries are now standard in Google Meet, used by millions every month.Workplace Insight: AI helps analyze employee surveys, summarize trends, and even streamline hiring by matching skills to roles.Sustainability Wins: In Google cafés, AI cut food waste per employee by nearly 40% compared to 2019. Hosted on Acast. See acast.com/privacy for more information.

Aug 22, 20257 min

S3 Ep 30NASA-Google AI Doctor - Revolutionary Space Medicine Transforming Earth Healthcare

Today we're examining one of the most ambitious medical AI projects ever undertaken: the NASA-Google Crew Medical Officer Digital Assistant, or CMO-DA. This revolutionary collaboration isn't just solving space medicine challenges - it's creating the blueprint for autonomous healthcare AI that could fundamentally transform medicine on Earth. With 88% diagnostic accuracy operating completely independently from Earth-based medical support, CMO-DA represents the most advanced autonomous medical AI system ever developed. We'll explore the cutting-edge technical architecture, examine the strategic business implications for the healthcare AI market, and discuss how this space-validated technology could revolutionize remote healthcare, emergency medicine, and medical access in underserved communities worldwide. Hosted on Acast. See acast.com/privacy for more information.

Aug 15, 202524 min

S3 Ep 29GPT-5 – OpenAI’s New AI Model Taking ChatGPT to the Next Level

GPT-5 is OpenAI’s latest and most advanced generative language model—the brain behind the new version of ChatGPT. It is described as a significant leap in intelligence compared to earlier models, with top performance in areas such as code, mathematics, writing, healthcare, and even visual understanding. Hosted on Acast. See acast.com/privacy for more information.

Aug 9, 202527 min

S3 Ep 28Lessons from OpenAI: Best Practices for Engineering Teams

Want to Learn from OpenAI’s Software Development Team? Here’s the Secret Sauce Behind Their SuccessCurious how OpenAI ships cutting-edge AI products at scale? Whether you’re an engineer looking to grow, a tech lead scaling your team, or exploring your next big career move — this post breaks down the real engineering practices behind OpenAI’s success.From monorepos and FastAPI to team autonomy and CI challenges, here’s what makes their engineering culture work — and what we can all learn (and apply) from it. Hosted on Acast. See acast.com/privacy for more information.

Aug 7, 202522 min

S3 Ep 27Key July 2025 Advances in AI and Technology

Trump US AI Plan, OpenAI Agents and record AI Investment from Big Tech! Hosted on Acast. See acast.com/privacy for more information.

Aug 3, 202522 min

S3 Ep 26US AI Strategy 2025: Key Insights for Business Leaders and Policymakers

US AI Strategy: Key Insights for Business Leaders and PolicymakersThe United States has established artificial intelligence as a critical strategic priority, with AI now permeating nearly every sector of the economy. As 2025 unfolds, American businesses and government agencies are accelerating efforts to harness AI’s benefits while policymakers navigate the complex challenge of maintaining the nation’s competitive edge amid intensifying global competition. Hosted on Acast. See acast.com/privacy for more information.

Jul 23, 202521 min