the bioinformatics lab
72 episodes — Page 1 of 2
EP 69: Honeybee Virology and Colony Collapse with Declan Schroeder

EP 68: Bioinformatics Careers, Communication, and Workflow Managers with Cristina Guijarro-Clarke
Summary In this podcast episode, the host and guest Cristina Guijarro-Clarke discuss various aspects of bioinformatics, including the importance of communication, navigating career paths, the role of social media, and the transition from academia to industry. We delve into the significance of workflow managers and the evolution of bioinformatics tools, emphasizing the need for both technical and soft skills in the field. The conversation highlights the collaborative nature of bioinformatics and the future trends shaping careers in this domain.

EP 67: Pathogen Genomics for Infection Prevention and Control with Ellie Carter
Ellie's background in microbiology and public health shaped her career in infection prevention. Building a laboratory network in Indiana was a significant project that enhanced public health. Infection preventionists play a crucial role in managing outbreaks in healthcare settings. Genomic sequencing is becoming increasingly important in identifying and controlling infections. Current outbreak detection methods in healthcare are often limited and retrospective. APIC 2024 highlighted the potential of genomic epidemiology in healthcare. Cost-effectiveness is a key factor in adopting genomic sequencing in hospitals. Challenges include financial investment and the need for bioinformatics expertise. Healthcare systems must learn from public health to implement genomic sequencing effectively. The ultimate goal is to prevent harm to patients through better infection control practices.

EP 66: One Health, Genomic Epidemiology, and Antimicrobial Resistance with Anita Kasanga
Summary In this conversation, Anita shares her experiences in critical patient care, emphasizing the importance of timely clinical alerts and collaboration among healthcare professionals. She recounts a specific case involving a 12-year-old girl post-surgery, highlighting the urgency of addressing antibiotic resistance and the need for effective communication within the medical team.

EP 65: Taking Time Off with Christie Rose Woodside
In this episode of the Bioinformatics Lab Podcast, Christie Rose Woodside shares her journey from academia to industry, her experiences during the COVID-19 pandemic, and her decision to walk El Camino with her father. The conversation explores the importance of taking time off for personal growth, the impact of experiences on career choices, and the value of stepping back to gain perspective on life and work.

EP 64: Machine Learning and Mutli-Omics Data with Mxolisi Nene
In this episode of the Bioinformatics Lab Podcast, Mxolisi Nene shares his journey from a curious kid “scanning soil” with a stick and a broken Pentium II in rural KwaZulu-Natal to a bioinformatician and PhD candidate at the Agricultural Research Council in Pretoria. He walks through his path from animal science into bioinformatics, profiling the gut microbiomes of indigenous village chickens using 16S and metagenomic sequencing, and how wrestling with messy real-world data led him into multi-omics integration and machine learning. Mxolisi explains concepts like feature engineering, neural networks, and ecological “tipping points” in soil ecosystems—showing how combining metagenomic, metabolomic, proteomic, and genomic layers can help predict when an environment is on the brink of collapse, with implications for agriculture, food security, and even disease research. We also dig into the philosophical side of his work: why the explosion of public omics data makes it almost a moral obligation to use these tools for better outbreak prevention and environmental stewardship, how conferences like PHA4GE in Cape Town and the AI working group are quietly seeding a new generation of multi-omics scientists, and what it feels like to realize that the five-year-old kid obsessed with dirt grew up to do exactly what he was pretending to do—only now with HPC clusters, neural nets, and GitHub.

EP 63: Mycotic Bioinformatics with Zachary Konkel
In this episode of the Bioinformatics Lab Podcast, Zachary Konkel shares his journey from studying chemistry to specializing in mycology and bioinformatics. He discusses the significance of fungi in medicine, particularly in producing antimicrobial compounds, and the challenges faced in fungal genomics. Zach highlights the development of mycotools (https://github.com/xonq/mycotools) as a solution for analyzing fungal genomes and the importance of phylogenetics in understanding fungal pathogens like Candida auris. The conversation also touches on the growing concern of antifungal resistance and the potential for next-generation sequencing to improve outbreak detection. Finally, Zach shares his passion for mushroom foraging, connecting his scientific interests with personal hobbies.

EP 62: Public Health Pathogen Genomics in Africa with James Otieno
Summary In this episode of the Bioinformatics Lab podcast, James Oteno shares his journey as a bioinformatician and genomic epidemiologist, discussing the intersection of bioinformatics and public health, particularly in the context of emerging pathogens in Africa. The conversation covers the importance of both wet lab and dry lab skills, the role of KEMRI in global health, and the impact of COVID-19 on public health practices. Oteno also delves into the MPOX outbreak, the rapid sequencing technology used in Ebola outbreaks, and the future of genomic epidemiology in Africa, emphasizing the need for collaborative networks and equitable partnerships in global health efforts.

EP 61: MiDog Technical Deep Dive with Kaylie Zapanta
Summary In this engaging conversation, Kaylie Zapanta, a PhD in neurophysiology, discusses her journey from academia to her current role at MyDog, where she focuses on veterinary diagnostics through NGS data. The discussion highlights the complexities of microbial interactions, the importance of effective communication of scientific data to clinicians, and the future of diagnostics in veterinary medicine, emphasizing the shift towards sequencing technologies over traditional methods.

EP 60: NGS for Veterinary Diagnostics with Janina Krumbeck (MiDog)
In this episode of the Bioinformatics Lab Podcast, Janina Krumbeck shares her unique journey from Germany to the United States, detailing her academic experiences and the challenges she faced during her PhD program. She discusses the importance of mentorship, the transition from academia to industry, and her current role in the microbiomics team at Zymo Research. Janina emphasizes the value of persistence and finding a fulfilling career path in the biotech industry.

EP 59: Navigating Bioinformatics and Public Health with Amy Gaskin
In this conversation, Amy Gaskin shares her journey into bioinformatics, detailing her transition from an ecological background to a career in public health. She discusses the pivotal moments that led her to discover her passion for bioinformatics, the essential skills she acquired during her master's program, and her experiences working in pathogen genomics during the COVID-19 pandemic. Amy emphasizes the importance of workflow management, containerization, and standardization in bioinformatics, particularly in the context of public health. She also reflects on her current work modernizing HIV pipelines and the collaborative efforts within the field.

EP 58: From Veterinary Medicine to Pathogen Genomics with Gültekin Ünal
In this episode of the Bioinformatics Lab podcast, Gültekin Ünal shares his unique journey from veterinary medicine to bioinformatics, highlighting the pivotal moments that shaped his career. For more information about the PHA4GE organization discussed in this episode, please visit: https://pha4ge.org/ He discusses the importance of microbiology in understanding diseases, the impact of COVID-19 on public health practices, and the significance of the One Health approach in integrating animal and human health. Gültekin emphasizes the role of bioinformatics in pathogen surveillance and the need for collaboration within the scientific community to address global health challenges.

EP 57: Open Conversation with Tommy Tang
Tommy's Blog: https://divingintogeneticsandgenomics.com/ Summary In this conversation, Kevin Libuit and Tommy Tang explore the journey of communication in science, particularly in the field of bioinformatics. They discuss the importance of sharing knowledge, the transition from wet lab to computational biology, and the evolution of bioinformatics tools. Tommy shares his experiences with learning Unix, the balance between remote work and in-person interactions, and the mindset of finding opportunities in crises. The conversation highlights the significance of humility, continuous learning, and the interconnectedness of roles within the scientific community. Takeaways Tommy started his blog to document his learning journey in bioinformatics. Criticism can be a positive opportunity for growth and collaboration. Journaling experiences helps in personal growth and reflection. Unix skills are foundational for bioinformatics and computational biology. The transition from wet lab to computational biology can be challenging but rewarding. Understanding the data generation process is crucial for analysis. In-person interactions enhance communication and collaboration. The biotech startup environment fosters agility and innovation. Mindset shifts can turn crises into opportunities. The distinction between analysts and developers is becoming more pronounced in the field.

EP 56: Open Conversation with Keith Robison
Ketih's Blog, Omics Omics: https://omicsomics.blogspot.com/ Summary In this conversation, Keith Robison discusses the inception of his blog Omix, the evolution of social media in the scientific community, and the importance of feedback and community engagement in scientific writing. He delves into the changing landscape of sequencing technologies, highlighting the emergence of new companies and the implications for biodiversity research. The discussion also touches on the future of sequencing, the challenges of cost reduction, and the potential for sequencing technologies to explore previously uncharted biological phenomena.

EP 55: Microbiome and Human Health with Momo Vuyisich
Summary In this conversation, Dr. Momo Vuyisich discusses the significance of metatranscriptomics in understanding the microbiome and its impact on human health. He emphasizes the importance of gene expression over gene content, the challenges of metabolomics, and the innovations in RNA technology that have made metatranscriptomics more accessible and cost-effective. Dr. Vujicic also shares insights on building a successful biotech company, the future of diagnostics, and how to define a healthy microbiome based on biochemical reactions. In this conversation, Momo and Kevin delve into the complexities of gut health, particularly focusing on hydrogen sulfide production and its implications for overall health. They discuss the intricacies of personalized nutrition, the limitations of traditional health metrics, and the need for a paradigm shift in nutritional science towards preventive maintenance. The conversation emphasizes the importance of understanding individual microbiome profiles to create tailored dietary recommendations, moving away from one-size-fits-all approaches. More info at https://www.viome.com/

EP 54: Pathogen Genomics in Healthcare w/ Alex Sundermann
Pathogen genomics in healthcare: overcoming barriers to proactive surveillance: https://journals.asm.org/doi/10.1128/aac.01479-24 Summary In this conversation, Kevin Libuit and Alex Sundermann delve into the significance of pathogen genomics in healthcare, discussing its role in infection prevention and control. They explore the transition from traditional methods to genomic surveillance, the challenges of operationalizing these technologies, and the ethical implications of transparency in patient care. The discussion also highlights the need for evidence-based practices, the importance of incentives for hospitals, and the potential for lowering barriers to entry for genomic technologies in healthcare settings. In this conversation, Alex Sundermann and Kevin Libuit discuss the critical role of genomic surveillance in infection prevention and control. They emphasize the need for actionable insights from genomic data, the importance of metadata, and the challenges of data sharing across institutions. Financial sustainability and legal implications of genomic practices are explored, alongside the necessity for community standards and equitable access to genomic technologies. The conversation concludes with a call for learning from outbreaks and the importance of publishing findings to improve patient safety. Takeaways Pathogen genomics enhances understanding of disease transmission in healthcare. Genomic surveillance can significantly improve infection prevention efforts. Public health has successfully utilized pathogen genomics for outbreak detection. The evidence supporting genomic surveillance is compelling but underutilized in healthcare. Operationalizing genomic surveillance requires collaboration between hospitals and payers. Ethical considerations are crucial in the implementation of genomic technologies. Incentives and regulations are needed to encourage hospitals to adopt genomic surveillance. Lowering the barriers to entry can facilitate the adoption of genomic technologies. Commercial partnerships can help hospitals access genomic sequencing services. Methodological standards in genomics are essential for effective interpretation of results. Genomic data must be actionable for infection preventionists. Metadata is crucial for effective genomic surveillance. Data sharing across institutions is a significant challenge. Financial sustainability is essential for genomic surveillance programs. Legal liability may increase for hospitals not using genomic surveillance. Community standards for genomic practices are needed. Equitable access to genomic technologies is vital. Learning from outbreaks can improve patient safety. Publishing outbreak findings is essential for knowledge sharing. Genomic surveillance can help identify misallocated resources.

EP 53: Remote Work
Summary In this episode of the Bioinformatics Lab podcast, Kevin Libuit and Andrew Page discuss the evolution of remote bioinformatics work, exploring the benefits and challenges of remote work, the importance of in-person interactions, and the ongoing debate surrounding the return to office policies. They share personal experiences and insights on how remote work has transformed their professional lives, the impact of commuting on quality of life, and the necessity of fostering communication and connection in a virtual environment. Takeaways Remote work allows hiring the best talent globally. Quality of life improves with reduced commuting time. In-person interactions enhance team dynamics and relationships. Informal conversations are crucial for problem-solving. Virtual communication lacks the nuances of face-to-face interactions. Creating informal communication spaces is essential in remote work. Return to office policies are a complex issue. Isolation can negatively impact mental health in remote work. Intentionality is key in fostering workplace culture remotely. Balancing remote work with in-person meetings is vital.

EP 51: Software Bugs
Summary In this episode, Kevin Libuit and Andrew Page discuss the complexities of software bugs, focusing on the distinction between user errors and software errors. They explore the importance of understanding user needs and the challenges of software compatibility, particularly in the context of bioinformatics. The conversation emphasizes the need for a multidisciplinary approach to software development and troubleshooting. Takeaways A lot of software lacks proper engineering and testing. User errors often stem from simple mistakes like formatting. Debugging requires careful attention to detail. New line endings can cause compatibility issues across systems. User documentation is crucial for effective troubleshooting. Understanding user needs can prevent software errors. Software engineers sometimes create unnecessary complexities. Output formats should be user-friendly and compatible. A multidisciplinary approach enhances software development. Effective communication between users and developers is essential.

EP 52: Genomic Epidemiology with Krisandra Allen
Summary In this episode of the Bioinformatics Lab podcast, Kevin Libuit and Krisandra Allen discuss the integration of bioinformatics and epidemiology, focusing on the challenges and solutions in utilizing genomic sequencing data for infectious disease surveillance. They explore the roles of epidemiologists and bioinformaticians, the importance of communication and collaboration, and the need for bi-directional learning to enhance public health practices. The conversation emphasizes the significance of building relationships and understanding each other's fields to effectively turn sequencing data into actionable insights for disease control. Takeaways Epidemiologists work on the front lines of public health. Genomic data can help build and confirm hypotheses in investigations. Integrating genetic data with epidemiological data enhances disease surveillance. Surveillance systems must be designed to drive public health action. Communication between labs and epidemiologists is crucial for effective data use. Bi-directional learning is essential for both epidemiologists and bioinformaticians. Genomic epidemiologists can bridge the gap between bioinformatics and epidemiology. Building relationships is the first step towards effective collaboration. Training opportunities should be available for both sides to learn from each other. Collaboration is key to successfully integrating bioinformatics into public health.

EP 50: Software Naming
Summary In this episode of the Bioinformatics Lab podcast, Kevin Libuit and Andrew Page discuss the art and challenges of naming bioinformatics tools. They explore various naming conventions, the significance of unique and memorable names, and the creative strategies employed by developers. The conversation also touches on the importance of trademark considerations and the potential pitfalls of naming conflicts in the bioinformatics community. Takeaways Naming bioinformatics tools is crucial for usability. A good name should be unique and memorable. Backronyms are a common naming strategy. Consistency in naming is important for software development. Names can be inspired by personal experiences or local culture. Trademark issues can arise with popular names. Creative themes can enhance the identity of tools. Names should be easy to Google and not offensive. Developers should consider the implications of their chosen names. A well-thought-out name can aid in communication and collaboration.

EP 49: H5N1 Outbreak
Summary In this episode of the Bioinformatics Lab Podcast, Kevin Libuit and Andrew Page discuss the ongoing H5N1 outbreak, its implications for public health, and the economic impact on agriculture. They explore the nature of avian influenza, the risks of mammalian spillover, and the importance of genomic surveillance in understanding and combating the virus. The conversation highlights the rapid advancements in vaccine development and the need for ongoing monitoring to prevent future outbreaks. Takeaways H5N1 is a highly pathogenic avian influenza affecting poultry and cattle. Influenza viruses are adept at recombining, posing pandemic risks. The current H5N1 outbreak has significant economic impacts on U.S. agriculture. Mammalian spillover from birds to cattle raises public health concerns. Surveillance data is crucial for understanding the spread of H5N1. Genomic sequencing aids in vaccine development and monitoring mutations. Pasteurization is vital in preventing the spread of H5N1 through dairy products. The economic impact of H5N1 in the U.S. is already over a billion dollars. Public health risks from H5N1 remain low but could increase with mutations. Collaboration between public health labs and tech communities is essential for effective response.

EP 47: The Job Hunt Experience pt. 02
Summary In this episode of the Buy Informatics Lab podcast, Kevin Libuit and Andrew Page discuss the intricacies of the job market from a recruiter's perspective. They share insights on how to effectively navigate the recruitment process, the importance of leveraging professional networks, and the role of online platforms like LinkedIn and GitHub in identifying and attracting talent. The conversation emphasizes the significance of having a strong digital footprint and the strategies that can help candidates stand out in a competitive job market. Takeaways Successful hires often come from personal networks. Recommendations from trusted colleagues add value. Passion for bioinformatics is crucial in candidates. Being findable on LinkedIn is essential for job seekers. A strong digital presence reduces friction in recruitment. Effective communication skills are vital for collaboration. Separating personal and professional online identities is advisable. Engagement in technical communities can highlight talent. GitHub profiles serve as portfolios for developers. A clear digital footprint enhances job application success.

EP 48: The Kalamari Database
Kalamari GitHub: https://github.com/lskatz/Kalamari & Public Health Announcement: https://journals.asm.org/doi/10.1128/mra.00963-24 Summary In this episode of the Bioinformatics Lab podcast, Kevin Libuit and Andrew Page discuss a public health announcement regarding the Kalamari database, a curated reference database for public health pathogenomics. They explore the importance of high-quality genomic data, the implications of mobile genetic elements, and the various applications of the database in public health. The conversation highlights the collaborative efforts of experts in the field and the significance of reliable data in making informed public health decisions. Takeaways The Kalamari database is a curated resource for public health. High-quality genomic data is essential for accurate pathogen identification. Public health decisions rely on the accuracy of genomic data. Mobile genetic elements complicate species typing in bioinformatics. The collaboration of experts enhances the quality of the database. The database allows for effective quality control in laboratories. Understanding plasmids is crucial for outbreak investigations. The CDC's involvement underscores the importance of public health infrastructure. The database is accessible and user-friendly for public health applications. This initiative represents a significant advancement in pathogen genomics.

Ep. 46: The Job Hunt Experience pt. 01
Summary In this episode, Kevin Libuit and Andrew Page discuss their personal experiences with job hunting in the bioinformatics field, emphasizing the importance of professional networks and genuine relationships. They share anecdotes about how their careers were shaped by connections made through academia and social interactions. The conversation highlights actionable advice for job seekers, including leveraging social media and engaging with the community to build a robust professional network. Takeaways All jobs can stem from knowing the right people. Building a professional network is essential for career growth. Genuine curiosity and kindness can lead to unexpected opportunities. Networking should be organic, not forced or transactional. Social media platforms are valuable tools for connecting with others. Engaging in community discussions can enhance visibility and opportunities. Conferences provide a great avenue for networking and learning. It's important to have a clear online presence, like LinkedIn. Networking is about mutual interest, not just self-promotion. Being helpful and interested in others can pay off in the long run.

Ep 45: Pathogen Focus - MPXV
Summary In this episode of the Bioinformatics Lab Podcast, Kevin Libuit and Andrew Page discuss the complexities of analyzing MPXV genomes, particularly in light of recent outbreaks. They explore the differences between MPXV and other viruses, the challenges faced in genomic sequencing and analysis, and the importance of public health responses, including vaccination efforts. The conversation highlights the need for updated tools and resources to effectively monitor and respond to MPXV outbreaks globally. Takeaways Viruses require different types of treatment and analysis. MPXV is a larger virus compared to SARS-CoV-2. Amplicon dropout is a significant concern in sequencing. Public health responses can effectively contain outbreaks. Genomic surveillance is crucial for monitoring MPXV. Different variants of MPXV require updated reference genomes. Vaccination efforts can limit the spread of M-Pox. The availability of hosts affects the outbreak dynamics. Bioinformatics tools need to be adaptable for different viruses. Collaboration among international labs enhances genomic analysis.

Ep 44: Career Focus:Julian Paganini & Olinto Linares-Perdomo
Note: This episode was recorded in Spanish. A video of this recording with AI-generated, English subtitles is available on YouTube: https://www.youtube.com/watch?v=FNjUbtcm9Z4 Summary In this episode of the Bioinformatics Lab Podcast, Kevin Libuit hosts Julian and Olinto in a special Spanish-language episode. They discuss their journeys in bioinformatics, the importance of collaboration in science, and their experiences at a hackathon. The conversation highlights the barriers of language in accessing scientific resources and the significance of community in learning and sharing knowledge. Takeaways The podcast aims to provide resources in Spanish for the scientific community. Julian's journey began in Argentina, focusing on bioinformatics and data fusion. Olento's background includes mathematical modeling and pharmacokinetics. Collaboration is key in scientific research and development. Hackathons provide excellent opportunities for networking and learning. Cultural exchange enriches the scientific community. Free online resources like Coursera are valuable for learning. The importance of community in overcoming learning barriers. Experiences at hackathons can lead to significant collaborations. Creating resources in Spanish is essential for inclusivity in science.

Ep 43: Pathogen Focus: Dengue Virus
Summary In this episode of the Bioinformatics Lab Podcast, Kevin Libuit and Andrew Page discuss the challenges and limitations in analyzing dengue virus genomes using bioinformatics. They highlight the importance of open-source tools, the need for better data sharing, and the complexities of virus typing compared to bacteria. The conversation emphasizes the neglected status of dengue as a pathogen and the urgent need for improved methodologies in public health. Takeaways Dengue is a small but significant virus affecting warmer climates. Current mathematical tools for dengue analysis are inadequate. Open-source tools are essential for sustainable dengue typing. Phylogenetic placement is crucial for understanding circulating strains. Different countries have unique circulating dengue types. Amplicon sequencing is a viable method for dengue analysis. Data sharing in the viral world is significantly limited. The viral taxonomy system is less organized than that of bacteria. There is a pressing need for better tools and methods for dengue research. The conversation around dengue needs to be expanded and prioritized.

Ep 42: GMI 14, Barcelona, Spain
Summary In this episode of the Bioinformatics Lab podcast, Kevin Libuit and Andrew Page discuss their experiences at the GMI 14 conference in Barcelona, reflecting on the importance of metadata in microbial bioinformatics, the role of AI in data sharing, and the significance of communicating the 'why' behind bioinformatics initiatives. They explore differing opinions on metadata's importance, the implications of AI technology, and the need for sustainable practices in the field. Takeaways GMI 14 maintained the spirit of GMI 13 with a focused cohort. The importance of metadata in tracking outbreaks was debated. AI's role in microbial bioinformatics is growing. Rich metadata enhances genomic epidemiology applications. Data sharing can lower the burden of collaboration. The 'why' behind bioinformatics initiatives is crucial for sustainability. Case studies can illustrate the impact of bioinformatics. Melbourne is set to host GMI 15 next year. Engaging with policymakers is essential for funding. The conference fosters connections among key players in the field.

Ep 41 Importance of Dev Environments
Summary In this episode, Kevin and Andrew discuss the importance of dev environments in bioinformatics and the adoption of software engineering practices. They highlight the need for separate environments for development, production, and staging, and the importance of stability and consistency in production code. They also emphasize the role of testing and automated processes in ensuring code quality. The conversation touches on the maturation of the field and the need for standardized practices in bioinformatics. They conclude by discussing the impact of bioinformatics on real-life decisions and the importance of professionalism in the field. Takeaways Dev environments are separate environments for development, production, and staging, and are essential for mitigating risks and ensuring code stability. Testing and automated processes play a crucial role in maintaining code quality and confidence in production code. The field of bioinformatics is maturing, and the adoption of software engineering practices and standardized practices is necessary for the widespread use of bioinformatics technologies. Bioinformatics has real-life impacts and requires professionalism to ensure the accuracy and reliability of the data. In the applied space, curiosity and drive for continuous improvement should be focused on solving the problems faced by the user base.

Ep 40 Career Spotlight: Sharvari Narendra
Summary In this conversation, Sharvari Narendra shares her career journey in bioinformatics, which includes experiences in academia, industry, and a startup. She emphasizes the importance of exploring different career tracks and finding alignment with personal passions. Sharvari highlights the creativity and problem-solving skills required in bioinformatics, as well as the need to understand the tools and pipelines used in data analysis. She also discusses the differences between working in academia and industry, including flexibility and the level of collaboration. Sharvari encourages early career professionals to attend conferences, network, and continue learning in order to advance in the field.

Ep 39: Bioinformatics Hackathons
ASM NGS Hackathon Details: - Date: 10-12 October 2024 - Location: Washington DC, Venue TBD (within 30 minutes of the ASM NGS venue) - Registration form: https://lnkd.in/e8KTqZYQ - Admission: Free (Sponsored by Theiagen Genomics) Summary In this episode, Kevin and Andrew discuss hackathons and the upcoming ASM NGS Hackathon. They explain that hackathons bring together people from different backgrounds to work on community problems and solve common challenges. The hackathon at ASM NGS will be a virtual and in-person event, allowing for broader participation. They emphasize the value of community building and networking that comes from hackathons, as well as the opportunity to collaborate and work on specific projects. They encourage people to bring their curiosity and willingness to collaborate to hackathons, regardless of their technical background. Takeaways Hackathons bring together people from different backgrounds to work on community problems and solve common challenges. The ASM NGS Hackathon will be a virtual and in-person event, allowing for broader participation. Hackathons provide opportunities for community building, networking, and collaboration. Participants should bring their curiosity and willingness to collaborate, regardless of their technical background.

Ep 38: Workflow Infrastructure
Also available on YouTube: https://youtu.be/i9oT5ku11is Summary In this episode, Kevin and Andrew discuss different mechanisms to run and support bioinformatics pipelines. They explore the use of workflows, workflow managers, and graphic user interfaces for pipeline execution. They highlight popular workflow options such as Nextflow, Terra, and Galaxy, and discuss the benefits of using standardized workflow languages. They also emphasize the importance of scalability and the drawbacks of creating in-house pipeline systems. The conversation concludes with a discussion on the differences in data models and administrative overhead among the different workflow options. Takeaways Understanding workflows is a fundamental skill in bioinformatics. Popular workflow options include Nextflow, Terra, and Galaxy. Standardized workflow languages help standardize development and execution. Using out-of-the-box solutions reduces the need for creating in-house pipeline systems. Consider the data model and administrative overhead when choosing a workflow option.

Ep 37: SRALite
YouTube: https://youtu.be/RybtspKHkgA Summary The conversation discusses the topic of SRA Lite, a data storage format for genomic information. SRA Lite is a compressed version of the original data, with the quality scores removed. The conversation highlights the challenges and controversies surrounding SRA Lite, including the loss of data granularity and the confusion around accessing the original files. The lack of clear communication from NCBI is also mentioned as a problem. The conversation suggests that the normalized files should be the default format for user access and retrieval. Overall, better communication and understanding of SRA Lite are needed. Takeaways SRA Lite is a compressed version of genomic data that removes quality scores. The use of SRA Lite has sparked controversy due to the loss of data granularity. There is confusion around accessing the original files and distinguishing between SRA Lite and full FASTQ files. Better communication and understanding of SRA Lite are needed to address user frustrations.

Ep 36: Technical Career Tracks
Also available on YouTube: https://youtu.be/ZlKYKuOeGZc Summary In this episode, Kevin Libuit and Andrew Page discuss the different levels of a bioinformatics developer and the career track in the field. They explain that bioinformatics is a relatively new field and the levels have emerged over time. The entry-level position is called a junior or associate bioinformatics scientist, where individuals follow established protocols and apply existing tools. The senior level involves more independent work and ownership of bioinformatics solutions. The principal level focuses on strategic decision-making and technical leadership. The hosts emphasize the importance of finding happiness and fulfillment at any level in the field. Takeaways Bioinformatics is a relatively new field, and the levels of a bioinformatics developer have emerged over time. The entry-level position involves following established protocols and applying existing tools. The senior level requires more independent work and ownership of bioinformatics solutions. The principal level focuses on strategic decision-making and technical leadership. It is important to find happiness and fulfillment at any level in the field.

Ep 35: Bioinformatics in the Cloud
Summary In this episode, Kevin and Andrew discuss cloud computing in the context of bioinformatics. They explore the different layers of cloud technology and how it can be implemented in laboratories. They highlight the benefits of cloud computing, such as quick access to resources, cost savings, and scalability. They also discuss the challenges of managing costs and ensuring security in a cloud environment. Overall, they emphasize the importance of understanding the specific needs of bioinformatics and choosing the right cloud infrastructure. Takeaways Cloud computing offers quick access to resources and cost savings compared to on-premises infrastructure. The specific needs of bioinformatics, such as high volume and different hardware requirements, should be considered when setting up a cloud infrastructure. Managing costs and ensuring security are important challenges in a cloud environment. Cloud computing provides scalability and flexibility to meet the evolving needs of bioinformatics.

Ep 34: PHA4GE with Dr. Alan Christoffels
PHA4GE Website: https://pha4ge.org/ Summary Alan Christoffels, the principal investigator for the Public Health Alliance for Genomic Epidemiology (PHA4GE), discusses the challenges and future directions of the consortium. He shares his background as a scientist and director of a program leading international collaborations in pathogen genomics. The conversation highlights the importance of global community building and the need to connect laboratories across borders to address common challenges in public health. The impact of PHA4GE in the response to the COVID-19 pandemic is also discussed, along with the value of creating a community of practice and providing opportunities for professionals to contribute. Alan invites individuals from various backgrounds to get involved in PHA4GE and contribute to its mission. Takeaways PHA4GE is a global consortium that aims to connect laboratories across borders to address common challenges in public health. Alan Christoffels emphasizes the importance of global community building and the need to involve individuals from various backgrounds in PHA4GE. The impact of PHA4GE in the response to the COVID-19 pandemic is discussed, highlighting the value of creating a community of practice and providing opportunities for professionals to contribute. Alan encourages individuals to get involved in PHA4GE and contribute to its mission.

Ep 33: Communicating Results in Public Health
Summary In this episode, Kevin Libuit and Andrew Page discuss the challenges of communicating complex genomic information to different audiences in public health. They explore the need for tailored communication strategies for various stakeholders, including bioinformatics scientists, epidemiologists, and the general public. The power of analogies in simplifying complex concepts is highlighted, along with the importance of building bridges between different technical fields. The conversation also emphasizes the need to communicate the limits and nuances of genomic data, and the role of genetic relatedness as a proxy for epidemiological associations. Overall, the episode underscores the ongoing need for effective communication in the field of genomics and public health. Takeaways Tailored communication strategies are necessary when communicating complex genomic information to different audiences in public health. Analogies can be powerful tools for simplifying complex concepts and making them relatable to a wide range of stakeholders. Building bridges between different technical fields, such as bioinformatics and epidemiology, is crucial for effective communication. Communicating the limits and nuances of genomic data is essential to avoid misinterpretation and ensure proper understanding.

Ep 32: Best Practices - Pipeline Development, Part Three
Summary This episode of the Bioinformatics Lab Podcast continues the conversation on public health pipeline best practices. The focus is on pipeline functionality, documentation for local install and remote access, and example usage. The hosts discuss the importance of clearly articulating the function of a pipeline and the different pipeline systems available. They emphasize the need for documentation that includes instructions for installation and usage, as well as providing example data. The episode concludes with a call to read the best practices document and the announcement of future efforts to promote and assess adherence to these practices. Takeaways Clearly articulate the function of a pipeline in the field of public health bioinformatics. Use pipeline systems like Galaxy and Terra to visually represent and manage workflows. Provide clear documentation for local installation and remote access of pipelines. Include example usage and data to facilitate understanding and testing of pipelines.

Ep 32: Best Practices - Pipeline Development, Part Two
PHA4GE Ten Best Practices for Public Health Bioinformatics Pipelines: https://github.com/pha4ge/public-health-pipeline-best-practices/blob/main/docs/pipeline-best-practices.md Summary In this episode, Kevin Libuit and Andrew Page discuss the 10 best practices for public health pipeline development. They start by emphasizing the use of common file formats and the importance of avoiding reinventing the wheel. They highlight the benefits of standard file formats and the availability of parsers for different languages. They also discuss the implementation of software testing, including the use of automated testing and the integration of testing with Docker containers. They emphasize the need for accessibility to benchmark or validation data sets and the importance of reference data requirements. They also touch on the significance of hiring bioinformaticians and the documentation practices that should be followed. Takeaways Use common file formats to avoid reinventing the wheel and enable compatibility with other programs. Implement software testing, including automated testing, to ensure functionality and identify bugs. Provide benchmark or validation data sets to allow users to compare and evaluate the performance of the pipeline. Consider the reference data requirements and ensure accessibility to curated databases. Hire bioinformaticians with domain expertise to navigate the complexities of pipeline development. Follow documentation practices, including communication of authorship, pipeline maintenance statements, and community guidelines for contribution and support.

Ep 32: Best Practices - Pipeline Development, Part One
PHA4GE Ten Best Practices for Public Health Bioinformatics Pipelines: https://github.com/pha4ge/public-health-pipeline-best-practices/blob/main/docs/pipeline-best-practices.md Summary In this episode, Andrew Page and Kevin Libuit discuss best practices for public health bioinformatics pipelines. They highlight the importance of code availability, open source licensing, version control, workflow management systems, and containerized and packaged software. These practices aim to improve transparency, reproducibility, and interoperability in the field of bioinformatics. Takeaways Code availability and open source licensing are crucial for transparency and collaboration in public health bioinformatics. Version control allows for the tracking of software changes and facilitates collaboration. Workflow management systems provide standardization and interoperability in pipeline development. Containerized and packaged software ensures reproducibility and simplifies software installation.

Ep31: AI-Generated Images
Summary In this episode, Kevin Libuit and Andrew Page discuss the use of AI-generated images in scientific publications. They start by addressing a recent retracted publication that featured images with anatomical inaccuracies and AI-generated labels. They highlight the failure of the peer review process and the responsibility of reviewers, editors, and authors in detecting such issues. The conversation expands to the broader problem of data manipulation and the potential use of AI in scientific writing. They also discuss the need for quality over quantity in publishing and the future of peer review. Takeaways The use of AI-generated images in scientific publications can lead to inaccuracies and fraudulent representations. The peer review process failed to detect the issues in the retracted publication, highlighting the need for improved review practices. AI has the potential to assist in scientific writing, but caution must be exercised to ensure accuracy and transparency. The publishing industry needs to prioritize quality over quantity and address the problem of low-quality journals accepting subpar papers.

Ep30 Part 2: Gut-Brain Axis with Kevin Bonham, Part Two
Manuscript Discussed: https://www.science.org/doi/10.1126/sciadv.adi0497 Summary This conversation discusses a study that explores the relationship between the gut microbiome and cognitive function. The study design involved collecting stool samples and cognitive function scores from a diverse population of children. The researchers used various statistical methods, including linear models and machine learning, to analyze the data. The results showed associations between specific microbial profiles and cognitive function scores, as well as correlations with brain regions. The study suggests the potential for using the gut microbiome as a screening tool and for future interventions to improve cognitive development. Takeaways The gut microbiome may play a role in cognitive development and function. Specific microbial profiles are associated with cognitive function scores. Machine learning techniques can help identify important microbial features. The gut microbiome may have implications for brain regions and cognitive performance.

Ep30 Part One: Gut-Brain Axis with Kevin Bonham
Manuscript Discussed: https://www.science.org/doi/10.1126/sciadv.adi0497 Summary In this episode, Dr. Kevin Bonham discusses the relationship between the gut microbiome and brain development. He explains the concept of the gut-brain axis and provides evidence for the connection between the two. Dr. Bonham also highlights the mutability of the gut microbiome and its potential impact on cognitive outcomes. The main objective of the study is to explore the association between gut microbiome profiles and cognitive capabilities in healthy individuals. Takeaways The gut microbiome and brain development are connected through the gut-brain axis. The gut microbiome plays a role in cognitive outcomes, including neurodevelopment and mental health. The gut microbiome is mutable and can be influenced by factors such as diet and antibiotics. Understanding the relationship between the gut microbiome and brain development can lead to potential interventions for cognitive impairments.

Ep29 Career Spotlight: Kevin G. Libuit
In this episode, Kevin Libuit discusses his career in the world of public health, pathogen genomics, and bioinformatics. He shares his journey from studying biology with the intention of pursuing a medical career to discovering his passion for bioinformatics during his undergraduate research. Kevin explains how he transitioned from medicine to bioinformatics and joined the Virginia State Lab, where he became involved in national initiatives and the creation of the State Public Health Bioinformatics Working Group (StaPH-B). He also talks about his time at Theiagen and the ever-evolving nature of bioinformatics careers. Takeaways Bioinformatics careers often involve unconventional paths and opportunities that arise along the way. Transitioning from one field to another requires making difficult decisions and following your passion. Collaboration and communication among bioinformatics scientists at the state and national levels are crucial for addressing common challenges. The field of bioinformatics is constantly evolving, and new positions and opportunities are being created.

Ep28 Food Safety Research Network
Summary In this episode of the Bioinformatics Lab Podcast, Kevin Libuit and Andrew Page discuss the Food Safety Research Network and a recent conference they attended. The network brings together government, academia, and industry to focus on food safety research. The conference had an interactive format, encouraging networking and collaboration among participants. The network provides funding for research projects and aims to bridge the gap between academia and industry. The conference covered various topics, including the influence of social media on food safety and challenges with new cooking devices and novel foods. The future steps for the Food Safety Research Network include seeking continued funding and furthering their research efforts. Takeaways The Food Safety Research Network brings together government, academia, and industry to focus on food safety research. The conference had an interactive format, encouraging networking and collaboration among participants. The network provides funding for research projects and aims to bridge the gap between academia and industry. Topics discussed at the conference included the influence of social media on food safety and challenges with new cooking devices and novel foods.

Ep27 Science and Social Media with Emma Hodcroft
Dr. Emma Hodcroft discusses her experience with scientific communication, particularly on social media. She shares her background in infectious diseases and how she became involved in science communication. Dr. Hodcroft reflects on the influence of her father's involvement in a football club on her early appreciation of communicating with a broader audience. She discusses the challenges of balancing personal and professional information on social media and the importance of being mindful of the messages being conveyed. Dr. Hodcroft also emphasizes the need for scientists to take on the responsibility of communicating their ideas and findings to the public. The conversation explores the role of experts in times of emergency and the importance of providing accurate information. It also highlights the work of scientists behind the scenes and encourages them to share their expertise. The conversation emphasizes the need for diverse representation in the media and concludes with a discussion on personal and professional content. Takeaways Balancing personal and professional information on social media is important to maintain a focused and effective communication strategy. Engaging with trolls and negative interactions on social media is generally unproductive and it is better to ignore them or respond with kindness. Scientists have a responsibility to communicate their ideas and findings to the public, but not everyone needs to be a science communicator in the traditional sense. Training and awareness of science communication should be provided to scientists to help them effectively communicate their work to a broader audience. In times of emergency, it is important to provide good, solid, and truthful information, even if you don't consider yourself the leading expert. Highlighting the work of scientists behind the scenes can help showcase the diverse contributions to the scientific community. Encouraging scientists to share their expertise and have confidence in their knowledge can lead to more diverse representation in the media. Personal and professional content can coexist and provide a well-rounded view of an individual's interests and experiences.

Ep 26 Utility of Open-Source Software
In this episode, Kevin Libuit and Andrew Page discuss the utility of open source software in the world of public health and pathogen genomics. They highlight the importance of reproducibility, cost-effectiveness, and community collaboration that open source software offers. The conversation also touches on the different open source licenses and the challenges of maintaining and supporting open source software in the long term. They distinguish between open source workflows and closed source platforms, emphasizing the need for reproducibility in workflows and the challenges of making platforms fully open source. Overall, the discussion emphasizes the value and challenges of open source software in public health. Takeaways Open source software in public health enables reproducibility, cost-effectiveness, and community collaboration. Different open source licenses have varying levels of permissiveness and restrictions. Open source workflows allow for reproducibility and interoperability, while closed source platforms may limit accessibility. Maintaining and supporting open source software in the long term can be challenging. Efforts are being made to establish standards and assessments for open source software in public health.

Ep 25 Containerized Software
Ep 25 Containerized Software by The Bioinformatics Lab

Ep 24 Reasons to (or not to) Pursue a PhD
Ep 24 Reasons to (or not to) Pursue a PhD by The Bioinformatics Lab

Ep 23 Benefits of Cloud Computing for Bioinformatics
In this episode we discuss the shift from traditional High-Performance Computing (HPC) systems to cloud computing in bioinformatics. Key points of the conversation include: Transition to Cloud Computing: The discussion begins with the gradual adoption of cloud computing over the past 5-6 years in bioinformatics, highlighting that for many scientists, moving from an HPC infrastructure to the cloud represents a significant change. Experiences with HPC and Cloud Transition: Andrew shares his experience transitioning from a traditional HPC setup to a cloud system, describing the process and the initial challenges faced, such as adapting to different software and operational methods. Benefits of Cloud Over Traditional HPC: The hosts discuss the advantages of cloud computing, such as overcommitting CPUs for better resource utilization and the flexibility it offers compared to physical hardware limitations of HPC. Challenges in Adopting Cloud Technology: They address the initial resistance from IT departments in adopting cloud services, discussing how the transition affects IT operations and the control research groups have over resources. Cost Management and Efficiency in the Cloud: Kevin and Andrew discuss the need for careful management of cloud resources to avoid unexpected costs, such as runaway jobs or excessive logging fees. They emphasize the importance of monitoring and optimizing cloud usage to balance efficiency and cost. Choosing a Cloud Provider: The conversation shifts to the topic of selecting a cloud provider, with a discussion on the similarities and differences between major providers like AWS, GCP, and Azure. They note that while these platforms offer comparable services, specific features and pricing models might vary. Advice for Early Career Bioinformaticians: The hosts conclude by advising those new to bioinformatics to familiarize themselves with cloud computing, noting its growing importance across various fields, not just in bioinformatics. The episode provides a comprehensive overview of the transition from traditional HPC to cloud computing in the bioinformatics field, along with practical advice and insights into managing and optimizing cloud resources.