Nach Genre filtern

Data Skeptic

Kyle Polich

The Data Skeptic Podcast features interviews and discussion of topics related to data science, statistics, machine learning, artificial intelligence and the like, all from the perspective of applying critical thinking and the scientific method to evaluate the veracity of claims and efficacy of approaches.

554 - Graphs for HPC and LLMs

0:00 / 0:00

554 - Graphs for HPC and LLMs
We are joined by Maciej Besta, a senior researcher of sparse graph computations and large language models at the Scalable Parallel Computing Lab (SPCL). In this episode, we explore the intersection of graph theory and high-performance computing (HPC), Graph Neural Networks (GNNs) and LLMs.
Tue, 29 Oct 2024 - 52min
553 - Graph Databases and AI
In this episode, we sit down with Yuanyuan Tian, a principal scientist manager at Microsoft Gray Systems Lab, to discuss the evolving role of graph databases in various industries such as fraud detection in finance and insurance, security, healthcare, and supply chain optimization.
Mon, 21 Oct 2024 - 35min
552 - Network Analysis in Practice
Our new season "Graphs and Networks" begins here! We are joined by new co-host Asaf Shapira, a network analysis consultant and the podcaster of NETfrix – the network science podcast. Kyle and Asaf discuss ideas to cover in the season and explore Asaf's work in the field.
Mon, 14 Oct 2024 - 29min
551 - Animal Intelligence Final Exam
Join us for our capstone episode on the Animal Intelligence season. We recap what we loved, what we learned, and things we wish we had gotten to spend more time on. This is a great episode to see how the podcast is produced. Now that the season is ending, our current co-host, Becky, is moving to emeritus status. In this last installment we got to spend a little more time getting to know Becky and where her work will take her after this. Did Data Skeptic inspire her to learn more about machine learning? Tune in and find out.
Mon, 07 Oct 2024 - 30min
550 - Process Mining with LLMs
David Obembe, a recent University of Tartu graduate, discussed his Masters thesis on integrating LLMs with process mining tools. He explained how process mining uses event logs to create maps that identify inefficiencies in business processes. David shared his research on LLMs' potential to enhance process mining, including experiments evaluating their performance and future improvements using Retrieval Augmented Generation (RAG).
Tue, 24 Sep 2024 - 26min
549 - Open Animal Tracks
Our guest today is Risa Shinoda, a PhD student at Kyoto University Agricultural Systems Engineering Lab, where she applies computer vision techniques.
She talked about the OpenAnimalTracks dataset and what it was used for. The dataset helps researchers predict animal footprint. She also discussed how she built a model for predicting tracks of animals. She shared the algorithms used and the accuracy they achieved. She also discussed further improvement opportunities for the model.
Tue, 17 Sep 2024 - 22min
548 - Bird Distribution Modeling with Satbird
This episode features an interview with Mélisande Teng, a PhD candidate at Université de Montréal. Her research lies in the intersection of remote sensing and computer vision for biodiversity monitoring.
Tue, 10 Sep 2024 - 39min
547 - Ant Encounters
In this interview with author Deborah Gordon, Kyle asks questions about the mechanisms at work in an ant colony and what ants might teach us about how to build artificial intelligence. Ants are surprisingly adaptive creatures whose behavior emerges from their complex interactions. Aspects of network theory and the statistical nature of ant behavior are just some of the interesting details you'll get in this episode.

Mon, 26 Aug 2024 - 31min
546 - Computing Toolbox
This season it’s become clear that computing skills are vital for working in the natural sciences. In this episode, we were fortunate to speak with Madlen Wilmes, co-author of the book "Computing Skills for Biologists: A Toolbox". We discussed the book and why it’s a great resource for students and teachers. In addition to the book, Madlen shared her experience and advice on transitioning from academia to an industry career and how data analytic skills transfer to jobs that your professionals might not always consider. Join us and learn more about the book and careers using transferable skills.
Mon, 19 Aug 2024 - 38min
545 - Biodiversity Monitoring
In this episode, we talked shop with Hager Radi about her biodiversity monitoring work. While biodiversity modeling may sound simple, count organisms and mark their location, there is a lot more to it than that! Incomplete and biased data can make estimations hard. There are also many species with very few observations in the wild. Using machine learning and remote sensing data, scientists can build models that predict species distributions with limited data. Listen in and hear about Hager’s work tackling these challenges and the tools she has built.
Wed, 14 Aug 2024 - 32min
544 - Hacking the Colony
Today, Ashay Aswale and Tony Lopez shared their work on swarm robotics and what they have learned from ants. Robotic swarms must solve the same problems that eusocial insects do. What if your pheromone trail goes cold? What if you’re getting bad information from a bad-actor within the swarm? Answering these questions can help tackle serious robotic challenges. For example, a swarm of robots can lose a few members to accidents and malfunctions, but a large robot cannot. Additionally, a swarm could be host to many castes like an ant colony. Specialization with redundancy built in seems like a win-win! Tune in and hear more about this fascinating topic.
Thu, 08 Aug 2024 - 41min
543 - Primate Poses
During this season we have talked with researchers working to utilize machine learning for behavioral observations. In previous episodes, you have heard about the software people like Richard use, but you haven’t heard much from scientists modifying and using these tools for specific research cases. PhD student, Richard Vogg, is working with multi-camera set-ups to track lemurs and macaques solving puzzle boxes in the wild. His work is part of a larger movement to automate behavioral analyses of video data. Listen in and learn why this tech is useful and why multi-camera setups are a good idea for more reliably identifying poses and individual animals.
Wed, 31 Jul 2024 - 32min
542 - Generating 3D Animals with YouDream
Generative AI can struggle to create realistic animals and 2D representations often have mistakes like extra limbs and tails. If 2D wasn’t hard enough, there are researchers working on generative 3D models. 3D models present an extra challenge because there is paucity of training datasets.In this episode, PhD students Sandeep and Oindrila walked us through their work on creating 3D animals using 2D data. Join us to learn about their pipelines, quality control, tie in with iNaturalist, and how this tech could streamline FX pipelines.
Tue, 23 Jul 2024 - 1h 00min
541 - Weird Communication
Today, we sat down with Dr. Ignacio Escalante Meza to learn about opiliones and treehoppers. Opiliones, known as “daddy long legs” in the US, are understudied arachnids known for their tenacious locomotor behavior, sociality, and chemical communication. Treehoppers communicate through the stems of plants using vibrations. They can signal danger, attract mates, and communicate with their offspring. Join us to learn how researchers turn their vibrations into sound waves and study what they have to say.
Mon, 15 Jul 2024 - 38min
540 - Reducing the Impact of Ship Noise on Marine Mammals
Human shipping operations have increased significantly in the past few decades. While that means international trade and cheap goods for humans, it also means the ocean has experienced an increase in noise pollution. This has a measurable negative impact on marine mammals and other aquatic life. Could mathematics be the solution? This interview explores how optimization techniques can guide voyage optimization in a way that handles multiple optimization objectives including fuel cost and sound reduction.
Mon, 01 Jul 2024 - 36min
539 - Analysis of Unstructured Data
Robbie Moon from the Georgia Tech Scheller College of Business joins us to discuss the analysis of unstructured data and the application of NLP methodologies towards financial data.
Fri, 28 Jun 2024 - 27min
538 - iNaturalist
Have you ever participated in citizen science? Do you want to? One of the most popular platforms for crowdsourcing biodiversity data is iNaturalist. In addition to being a great science tool, the iNaturalist app can help you identify the organisms you encounter every day. We talked to Executive Director Scott Laurie about how scientists use iNaturalist. We also got to discuss what makes iNaturalist’s AI species recognition so good, and how citizen scientists are constantly providing high-quality training data. Listen in and learn how this fun-to-use tool works, where it's headed, and how you can get involved.
Mon, 24 Jun 2024 - 37min
537 - Learn to Code
Do you code or are you interested in learning to code? Join us today and hear from three individuals that are at very different stages of their coding journeys. Becky Hansis-O’Neill (also our co-host this season) shares her experiences as a newbie who wants to learn more. Dr. Malia Gehan, a self-taught developer interested in studying plant phenotypes, explains why and how she and her colleagues learned to code and developed PlantCV. Finally, Dr. John Wilmes discusses his work as a professional mathematician and Machine Learning Research Engineer. Whether you are thinking about learning to code or an expert, we’re sure you will see a bit of yourself in this episode.
Tue, 18 Jun 2024 - 49min
536 - Animal Computer Interaction
You’ve heard of Human Computer Interaction (HCI), now get ready for Animal Computer Interaction (ACI). Ilyena has made a career developing computer interfaces for non-human animals. She has worked with dogs, parrots, primates, and even giraffes. This is challenging because animals have a wide range of abilities and preferences. Parrots, for example, use their tongues to make selections on touchscreens. Listen in on our conversation and learn about interface development and testing with animals and how technology may improve animal welfare.
Mon, 10 Jun 2024 - 42min
535 - Ape Gestures
Cat observes great apes in the wild and in the lab to crack the code of their gestural communication. We discussed the challenges and benefits of studying apes in the wild vs in the lab. Cat also shared how her lab identifies and studies ape gestures. It turns out that humans are pretty good at guessing what apes are trying to communicate with one another. Join us in this episode to learn more about the evolution of communication in great apes, and what we can learn from our closest relatives.
Mon, 03 Jun 2024 - 49min
534 - Evaluating AI Abilities
In this episode, Kozzy discusses his endeavors to compare the cognitive abilities of humans, animals, and AI programs. Specifically, we discussed object permanence, the ability to understand an object still exists in space even when you can’t see it. Our conversation traverses both philosophical and practical questions surrounding AI evaluation. We also learned about Animal AI 3, a gaming environment developed in Unity where AI programs and humans can go head-to-head to solve different problems in a gaming environment.
Mon, 27 May 2024 - 49min
533 - HMMs for Behavior
Théo Michelot has made a career out of tackling tough ecological questions using time-series data. How do scientists turn a series of GPS location observations over time into useful behavioral data? GPS tech has improved to the point that modern data sets are large and complex. In this episode, Théo takes us through his research and the application of Hidden Markov Models to complex time series data. If you have ever wondered what biologists do with data from those GPS collars you have seen on TV, this is the episode for you!
Mon, 20 May 2024 - 45min
532 - Bioinspired Engineering
Brian Taylor shares his research on magnetoreception. Animals like birds and sea turtles use magnetoreception to use the Earth’s magnetic field for navigation, but it’s not a sense that’s well understood. Brian uses animal magnetoreception to engineer new ways to navigate the globe. Even cooler, he also takes hypotheses for how magnetoreception works in animals and uses computational simulations to digitally test them. Check out this episode to hear more about Brian’s research and learn more about this little known sensory ability.
Tue, 14 May 2024 - 38min
531 - Modelling Evolution
Modeling evolutionary processes goes way beyond the Hardy-Weinberg Equilibrium we all learned in biology class. Natural selection comes from many sources like resources availability, mate preferences, competition. Modeling entire populations of organisms of different species is the holy grail of digital evolution. Join our discussion with evolutionary biologist and software engineer Ben Haller to learn about his work on SLiM and how it helps other biologists model population genetics over time.
Thu, 09 May 2024 - 41min
530 - Behavioral Genetics
It’s almost impossible to think about animal behavior without thinking of dogs! Our canine friends are a subspecies of wolf that has been co-evolving with us for tens of thousands of years. The transition from wolf to pet has required intense natural and artificial selection for behaviors that allow dogs to live alongside humans, but behavior is not so simple. Join us for a discussion with Dr. Jessica Hekman and learn about dog welfare, behavioral genetics, and the quest to understand the dogs in our lives.
Tue, 30 Apr 2024 - 47min
529 - Signal in the Noise
In this episode, we are joined by Barbara Webb and Anna Hadjitofi. Barbara runs the Insect Robotics lab at the University of Edinburgh, and Anna is a PhD student at the School of Informatics at the university. She is interested in studying and understanding the neural mechanism of the honeybee waggle dance. They join us to discuss the paper: Dynamic antennal positioning allows honeybee followers to decode the dance.
Thu, 25 Apr 2024 - 41min
528 - Pose Tracking
Many researchers and students have painstakingly labeled precise details about the body positions of the creatures they study. Can AI be used for this labeling? Of course it can! Today's episode discusses Social LEAP Estimates Animal Poses (SLEAP), a software solution to train AI to perform this tedious but important labeling work.
Tue, 16 Apr 2024 - 50min
527 - Modeling Group Behavior
Our guest in this episode is Sebastien Motsch, an assistant professor at Arizona State University, working in the School of Mathematical and Statistical Science. He works on modeling self-organized biological systems to understand how complex patterns emerge.
Mon, 08 Apr 2024 - 40min
526 - Advances in Data Loggers
Our guest in this episode is Ryan Hanscom. Ryan is a Ph.D. candidate in a joint doctoral evolution program at San Diego State University and the University of California, Riverside. He is a terrestrial ecologist with a focus on herpetology and mammalogy. Ryan discussed how the behavior of rattlesnakes is studied in the natural world, particularly with an increase in temperature.
Mon, 25 Mar 2024 - 35min
525 - What You Know About Intelligence is Wrong (fixed)
We are joined by Hank Schlinger, a professor of psychology at California State University, Los Angeles. His research revolves around theoretical issues in psychology and behavioral analysis. Hank establishes that words have references and questions the reference for intelligence. He discussed how intelligence can be observed in animals. He also discussed how intelligence is measured in a given context.
Wed, 20 Mar 2024 - 41min
523 - Animal Decision Making
On today’s episode, we are joined by Aimee Dunlap. Aimee is an assistant professor at the University of Missouri–St. Louis and the interim director at the Whitney R. Harris World Ecology Center.
Aimee discussed how animals perceive information and what they use it for. She discussed the connection between their environment and learning for decision-making. She also discussed the costs required for learning and factors that affect animal learning.
Tue, 12 Mar 2024 - 37min
522 - Octopus Cognition
We are joined by Tamar Gutnick, a visiting professor at the University of Naples Federico II, Napoli, Italy. She studies the octopus nervous system and their behavior, focusing on cognition and learning behaviors.
Tamar gave a background to the kind of research she does — lab research. She discussed some challenges with observing octopuses in the lab. She discussed some patterns observed by the octopus lifestyle in a controlled setting.
Tamar discussed what they know about octopus intelligence. She discussed the octopus nervous system and why they are unique compared to other animals. She discussed how they measure the behavior of octopuses using a video recording and a logger to track brain activity.
Fri, 08 Mar 2024 - 38min
521 - Optimal Foraging
Claire Hemmingway, an assistant professor in the Department of Psychology and Ecology and Evolutionary Biology at the University of Tennessee in Knoxville, is our guest today. Her research is on decision-making in animal cognition, focusing on neotropical bats and bumblebees.
Claire discussed how bumblebees make foraging decisions and how they communicate when foraging. She discussed how they set up experiments in the lab to address questions about bumblebees foraging. She also discussed some nuances between bees in the lab and those in the wild.
Claire discussed factors that drive an animal's foraging decisions. She explained the foraging theory and how a colony works together to optimize its foraging. She also touched on some irrational foraging behaviors she observed in her study.
Claire discussed some techniques bees use to learn from past behaviors. She discussed the effect of climate change on foraging bees' learning behavior.
Claire discussed how bats respond to calling frogs when foraging. She also spoke about choice overload in that they make detrimental decisions when loaded with too many options.
Wed, 28 Feb 2024 - 38min
520 - Memory in Chess
On today’s show, we are joined by our co-host, Becky Hansis-O’Neil. Becky is a Ph.D. student at the University of Missouri, St Louis, where she studies bumblebees and tarantulas to understand their learning and cognitive work.

She joins us to discuss the paper: Perception in Chess. The paper aimed to understand how chess players perceive the positions of chess pieces on a chess board. She discussed the findings paper. She spoke about situations where grandmasters had better recall of chess positions than beginners and situations where they did not.

Becky and Kyle discussed the use of chess engines for cheating. They also discussed how chess players use chunking. Becky discussed some approaches to studying chess cognition, including eye tracking, EEG, and MRI.
## Paper in Focus
Perception in chess
## Resources
Detecting Cheating in Chess with Ken Regan
Mon, 12 Feb 2024 - 48min
519 - OpenWorm
On this episode, we are joined by Stephen Larson, the CEO of MetaCell and an affiliate of the OpenWorm foundation. Stephen discussed what the Openworm project is about. They hope to use a digital C. elegans nematode (C. elegans for short) to study the basics of life.
Stephen discussed why C. elegans is an ideal organism for studying life in the lab. He also discussed the steps involved in simulating a digital organism. He mentioned the constraints on the cellular scale that informed their development of a digital C. elegans.
Stephen discussed the validation process of the simulation. He discussed how they discovered the best parameters to capture the behavior of natural C. elegans. He also discussed how biologists embraced the project.
Stephen discussed the computational requirements for improving the simulation parameters of the model and the kind of data they require to scale up. Stephen discussed some findings that the machine-learning communities can take away from the project. He also mentioned how students can get involved in the Openworm project. Rounding up, he shared future plans for the project.
Mon, 05 Feb 2024 - 34min
518 - What the Antlion Knows
Our guest is Becky Hansis-O’Neil, a Ph.D. student at the University of Missouri, St Louis, and our co-host for the new "Animal Intelligence" season. Becky shares her background on how she got into the field of behavioral intelligence and biology.
Tue, 30 Jan 2024 - 41min
517 - AI Roundtable
Kyle is joined by friends and former guests Pramit Choudhary and Frank Bell to have an open discussion of the impacts LLMs and machine learning have had in the past year on industry, and where things may go in the current year.
Wed, 17 Jan 2024 - 50min
516 - Uncontrollable AI Risks
We are joined by Darren McKee, a Policy Advisor and the host of Reality Check — a critical thinking podcast. Darren gave a background about himself and how he got into the AI space.
Darren shared his thoughts on AGI's achievements in the coming years. He defined AGI and discussed how to differentiate an AGI system. He also shared whether AI needs consciousness to be AGI.
Darren discussed his worry about AI surpassing human understanding of the universe and potentially causing harm to humanity. He also shared examples of how AI is already used for nefarious purposes. He explored whether AI possesses inherently evil intentions and gave his thoughts on regulating AI.
Wed, 27 Dec 2023 - 38min
515 - I LLM and You Can Too
It took a massive financial investment for the first large language models (LLMs) to be created. Did their corporate backers lock these tools away for all but the richest? No. They provided comodity priced API options for using them. Anyone can talk to Chat GPT or Bing. What if you want to go a step beyond that and do something programatic? Kyle explores your options in this episode.
Sat, 23 Dec 2023 - 23min
514 - Q&A with Kyle
We celebrate episode 1000000000 with some Q&A from host Kyle Polich. We boil this episode down to four key questions:
1) How do you find guests
2) What is Data Skeptic all about?
3) What is Kyle all about?
4) What are Kyle's thoughts on AGI?

Thanks to our sponsorsdataannotation.tech/programmers https://www.webai.com/dataskeptic

Tue, 19 Dec 2023 - 40min
513 - LLMs for Data Analysis
In this episode, we are joined by Amir Netz, a Technical Fellow at Microsoft and the CTO of Microsoft Fabric. He discusses how companies can use Microsoft's latest tools for business intelligence.
Amir started by discussing how business intelligence has progressed in relevance over the years. Amir gave a brief introduction into what Power BI and Fabric are. He also discussed how Fabric distinguishes from other BI tools by building an end-to-end tool for the data journey.
Amir spoke about the process of building and deploying machine learning models with Microsoft Fabric. He shared the difference between Software as a Service (SaaS) and Platform as a Service (PaaS).
Amir discussed the benefits of Fabric's auto-integration and auto-optimization abilities. He also discussed the capabilities of Copilot in Fabric. He also discussed exciting future developments planned for Fabric. Amir shared techniques for limiting Copilot hallucination.
Tue, 12 Dec 2023 - 29min
512 - AI Platforms
Our guest today is Eric Boyd, the Corporate Vice President of AI at Microsoft. Eric joins us to share how organizations can leverage AI for faster development.
Eric shared the benefits of using natural language to build products. He discussed the future of version control and the level of AI background required to get started with Azure AI. He mentioned some foundational models in Azure AI and their capabilities. Follow Eric on LinkedIn to learn more about his work.
Visit today's sponsor at https://webai.com/dataskeptic
Mon, 04 Dec 2023 - 33min
511 - Deploying LLMs
We are excited to be joined by Aaron Reich and Priyanka Shah. Aaron is the CTO at Avanade, while Priyanka leads their AI/IoT offering for the SEA Region. Priyanka is also the MVP for Microsoft AI. They join us to discuss how LLMs are deployed in organizations.
Mon, 27 Nov 2023 - 35min
510 - A Survey Assessing Github Copilot
In this episode, we are joined by Jenny Liang, a PhD student at Carnegie Mellon University, where she studies the usability of code generation tools. She discusses her recent survey on the usability of AI programming assistants.
Jenny discussed the method she used to gather people to complete her survey. She also shared some questions in her survey alongside vital takeaways. She shared the major reasons for developers not wanting to us code-generation tools. She stressed that the code-generation tools might access the software developers' in-house code, which is intellectual property.
Learn more about Jenny Liang via https://jennyliang.me/

Mon, 20 Nov 2023 - 26min
509 - Program Aided Language Models
We are joined by Aman Madaan and Shuyan Zhou. They are both PhD students at the Language Technology Institute at Carnegie Mellon University. They join us to discuss their latest published paper, PAL: Program-aided Language Models.
Aman and Shuyan started by sharing how the application of LLMs has evolved. They talked about the performance of LLMs on arithmetic tasks in contrast to coding tasks. Aman introduced their PAL model and how it helps LLMs improve at arithmetic tasks. He shared examples of the tasks PAL was tested on. Shuyan discussed how PAL’s performance was evaluated using Big Bench hard tasks.
They discussed the kind of mistakes LLMs tend to make and how the PAL’s model circumvents these limitations. They also discussed how these developments in LLMS can improve kids learning.
Rounding up, Aman discussed the CoCoGen project, a project that enables NLP tasks to be converted to graphs. Shuyan and Aman shared their next research steps.
Follow Shuyan on Twitter @shuyanzhxyc. Follow Aman on @aman_madaan.
Mon, 13 Nov 2023 - 32min
508 - Which Programming Language is ChatGPT Best At
In this episode, we have Alessio Buscemi, a software engineer at Lifeware SA. Alessio was a post-doctoral researcher at the University of Luxembourg. He joins us to discuss his paper, A Comparative Study of Code Generation using ChatGPT 3.5 across 10 Programming Languages. Alessio shared his thoughts on whether ChatGPT is a threat to software engineers. He discussed how LLMs can help software engineers become more efficient.
Mon, 06 Nov 2023 - 40min
507 - GraphText
On the show today, we are joined by Jianan Zhao, a Computer Science student at Mila and the University of Montreal. His research focus is on graph databases and natural language processing. He joins us to discuss how to use graphs with LLMs efficiently.

Tue, 31 Oct 2023 - 30min
506 - arXiv Publication Patterns
Today, we are joined by Rajiv Movva, a PhD student in Computer Science at Cornell Tech University. His research interest lies in the intersection of responsible AI and computational social science. He joins to discuss the findings of this work that analyzed LLM publication patterns.
He shared the dataset he used for the survey. He also discussed the conditions for determining the papers to analyze. Rajiv shared some of the trends he observed from his analysis. For one, he observed there has been an increase in LLMs research. He also shared the proportions of papers published by universities, organizations, and industry leaders in LLMs such as OpenAI and Google. He mentioned the majority of the papers are centered on the social impact of LLMs. He also discussed other exciting application of LLMs such as in education.
Mon, 23 Oct 2023 - 28min
505 - Do LLMs Make Ethical Choices
We are excited to be joined by Josh Albrecht, the CTO of Imbue. Imbue is a research company whose mission is to create AI agents that are more robust, safer, and easier to use. He joins us to share findings of his work; Despite "super-human" performance, current LLMs are unsuited for decisions about ethics and safety.

Mon, 16 Oct 2023 - 29min
504 - Emergent Deception in LLMs
On today’s show, we are joined by Thilo Hagendorff, a Research Group Leader of Ethics of Generative AI at the University of Stuttgart. He joins us to discuss his research, Deception Abilities Emerged in Large Language Models.
Thilo discussed how machine psychology is useful in machine learning tasks. He shared examples of cognitive tasks that LLMs have improved at solving. He shared his thoughts on whether there’s a ceiling to the tasks ML can solve.
Mon, 09 Oct 2023 - 27min
503 - Agents with Theory of Mind Play Hanabi
Nieves Montes, a Ph.D. student at the Artificial Intelligence Research Institute in Barcelona, Spain, joins us. Her PhD research revolves around value-based reasoning in relation to norms. She shares her latest study, Combining theory of mind and abductive reasoning in agent‑oriented programming.
Mon, 02 Oct 2023 - 38min
502 - LLMs for Evil
We are joined by Maximilian Mozes, a PhD student at the University College, London. His PhD research focuses on Natural Language Processing (NLP), particularly the intersection of adversarial machine learning and NLP. He joins us to discuss his latest research, Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and Vulnerabilities.
Mon, 25 Sep 2023 - 26min
501 - The Defeat of the Winograd Schema Challenge
Our guest today is Vid Kocijan, a Machine Learning Engineer at Kumo AI. Vid has a Ph.D. in Computer Science at the University of Oxford. His research focused on common sense reasoning, pre-training in LLMs, pretraining in knowledge-based completion, and how these pre-trainings impact societal bias. He joins us to discuss how he built a BERT model that solved the Winograd Schema Challenge.
Mon, 11 Sep 2023 - 31min
500 - LLMs in Social Science
Today, We are joined by Petter Törnberg, an Assistant Professor in Computational Social Science at the University of Amsterdam and a Senior Researcher at the University of Neuchatel. His research is centered on the intersection of computational methods and their applications in social sciences. He joins us to discuss findings from his research papers, ChatGPT-4 Outperforms Experts and Crowd Workers in Annotating Political Twitter Messages with Zero-Shot Learning, and How to use LLMs for Text Analysis.
Mon, 04 Sep 2023 - 34min
499 - LLMs in Music Composition
In this episode, we are joined by Carlos Hernández Oliván, a Ph.D. student at the University of Zaragoza. Carlos’s interest focuses on building new models for symbolic music generation.
Carlos shared his thoughts on whether these models are genuinely creative. He revealed situations where AI-generated music can pass the Turing test. He also shared some essential considerations when constructing models for music composition.
Mon, 28 Aug 2023 - 33min
498 - Cuttlefish Model Tuning
Hongyi Wang, a Senior Researcher at the Machine Learning Department at Carnegie Mellon University, joins us. His research is in the intersection of systems and machine learning. He discussed his research paper, Cuttlefish: Low-Rank Model Training without All the Tuning, on today’s show.
Hogyi started by sharing his thoughts on whether developers need to learn how to fine-tune models. He then spoke about the need to optimize the training of ML models, especially as these models grow bigger. He discussed how data centers have the hardware to train these large models but not the community. He then spoke about the Low-Rank Adaptation (LoRa) technique and where it is used.
Hongyi discussed the Cuttlefish model and how it edges LoRa. He shared the use cases of Cattlefish and who should use it. Rounding up, he gave his advice on how people can get into the machine learning field. He also shared his future research ideas.
Mon, 21 Aug 2023 - 27min
497 - Which Professions Are Threatened by LLMs
On today’s episode, we have Daniel Rock, an Assistant Professor of Operations Information and Decisions at the Wharton School of the University of Pennsylvania. Daniel’s research focuses on the economics of AI and ML, specifically how digital technologies are changing the economy.
Daniel discussed how AI has disrupted the job market in the past years. He also explained that it had created more winners than losers.
Daniel spoke about the empirical study he and his coauthors did to quantify the threat LLMs pose to professionals. He shared how they used the O-NET dataset and the BLS occupational employment survey to measure the impact of LLMs on different professions. Using the radiology profession as an example, he listed tasks that LLMs could assume.
Daniel broadly highlighted professions that are most and least exposed to LLMs proliferation. He also spoke about the risks of LLMs and his thoughts on implementing policies for regulating LLMs.
Tue, 15 Aug 2023 - 38min
496 - Why Prompting is Hard
We are excited to be joined by J.D. Zamfirescu-Pereira, a Ph.D. student at UC Berkeley. He focuses on the intersection of human-computer interaction (HCI) and artificial intelligence (AI). He joins us to share his work in his paper, Why Johnny can’t prompt: how non-AI experts try (and fail) to design LLM prompts. The discussion also explores lessons learned and achievements related to BotDesigner, a tool for creating chat bots.
Tue, 08 Aug 2023 - 48min
495 - Automated Peer Review
In this episode, we are joined by Ryan Liu, a Computer Science graduate of Carnegie Mellon University. Ryan will begin his Ph.D. program at Princeton University this fall. His Ph.D. will focus on the intersection of large language models and how humans think. Ryan joins us to discuss his research titled "ReviewerGPT? An Exploratory Study on Using Large Language Models for Paper Reviewing"
Mon, 31 Jul 2023 - 36min
494 - Prompt Refusal
The creators of large language models impose restrictions on some of the types of requests one might make of them. LLMs commonly refuse to give advice on committing crimes, producting adult content, or respond with any details about a variety of sensitive subjects. As with any content filtering system, you have false positives and false negatives.
Today's interview with Max Reuter and William Schulze discusses their paper "I'm Afraid I Can't Do That: Predicting Prompt Refusal in Black-Box Generative Language Models". In this work, they explore what types of prompts get refused and build a machine learning classifier adept at predicting if a particular prompt will be refused or not.
Mon, 24 Jul 2023 - 44min
493 - A Long Way Till AGI
Our guest today is Maciej Świechowski. Maciej is affiliated with QED Software and QED Games. He has a Ph.D. in Systems Research from the Polish Academy of Sciences. Maciej joins us to discuss findings from his study, Deep Learning and Artificial General Intelligence: Still a Long Way to Go.
Tue, 18 Jul 2023 - 37min
492 - Brain Inspired AI
Today on the show, we are joined by Lin Zhao and Lu Zhang. Lin is a Senior Research Scientist at United Imaging Intelligence, while Lu is a Ph.D. candidate at the Department of Computer Science and Engineering at the University of Texas. They both shared findings from their work When Brain-inspired AI Meets AGI.
Lin and Lu began by discussing the connections between the brain and neural networks. They mentioned the similarities as well as the differences. They also shared whether there is a possibility for solid advancements in neural networks to the point of AGI. They shared how understanding the brain more can help drive robust artificial intelligence systems.
Lin and Lu shared how the brain inspired popular machine learning algorithms like transformers. They also shared how AI models can learn alignment from the human brain. They juxtaposed the low energy usage of the brain compared to high-end computers and whether computers can become more energy efficient.
Tue, 11 Jul 2023 - 36min
491 - Computable AGI
On today’s show, we are joined by Michael Timothy Bennett, a Ph.D. student at the Australian National University. Michael’s research is centered around Artificial General Intelligence (AGI), specifically the mathematical formalism of AGIs. He joins us to discuss findings from his study, Computable Artificial General Intelligence.
Mon, 03 Jul 2023 - 36min
490 - AGI Can Be Safe
We are joined by Koen Holtman, an independent AI researcher focusing on AI safety. Koen is the Founder of Holtman Systems Research, a research company based in the Netherlands.
Koen started the conversation with his take on an AI apocalypse in the coming years. He discussed the obedience problem with AI models and the safe form of obedience.
Koen explained the concept of Markov Decision Process (MDP) and how it is used to build machine learning models.
Koen spoke about the problem of AGIs not being able to allow changing their utility function after the model is deployed. He shared another alternative approach to solving the problem. He shared how to engineer AGI systems now and in the future safely. He also spoke about how to implement safety layers on AI models.
Koen discussed the ultimate goal of a safe AI system and how to check that an AI system is indeed safe. He discussed the intersection between large language Models (LLMs) and MDPs. He shared the key ingredients to scale the current AI implementations.
Mon, 26 Jun 2023 - 45min
489 - AI Fails on Theory of Mind Tasks
An assistant professor of Psychology at Harvard University, Tomer Ullman, joins us. Tomer discussed the theory of mind and whether machines can indeed pass it. Using variations of the Sally-Anne test and the Smarties tube test, he explained how LLMs could fail the theory of mind test.
Mon, 19 Jun 2023 - 52min
488 - AI for Mathematics Education
The application of LLMs cuts across various industries. Today, we are joined by Steven Van Vaerenbergh, who discussed the application of AI in mathematics education. He discussed how AI tools have changed the landscape of solving mathematical problems. He also shared LLMs' current strengths and weaknesses in solving math problems.
Mon, 12 Jun 2023 - 35min
487 - Evaluating Jokes with LLMs
Fabricio Goes, a Lecturer in Creative Computing at the University of Leicester, joins us today. Fabricio discussed what creativity entails and how to evaluate jokes with LLMs. He specifically shared the process of evaluating jokes with GPT-3 and GPT-4. He concluded with his thoughts on the future of LLMs for creative tasks.
Tue, 06 Jun 2023 - 43min
486 - Why Machines Will Never Rule the World
Barry Smith and Jobst Landgrebe, authors of the book “Why Machines will never Rule the World,” join us today. They discussed the limitations of AI systems in today’s world. They also shared elaborate reasons AI will struggle to attain the level of human intelligence.
Mon, 29 May 2023 - 55min
485 - A Psychopathological Approach to Safety in AGI
While the possibilities with AGI emergence seem great, it also calls for safety concerns. On the show, Vahid Behzadan, an Assistant Professor of Computer Science and Data Science, joins us to discuss the complexities of modeling AGIs to accurately achieve objective functions. He touched on tangent issues such as abstractions during training, the problem of unpredictability, communications among agents, and so on.
Tue, 23 May 2023 - 49min
484 - The NLP Community Metasurvey
Julian Michael, a postdoc at the Center for Data Science, New York University, joins us today. Julian’s conversation with Kyle was centered on the NLP community metasurvey: a survey aimed at understanding expert opinions on controversial NLP issues. He shared the process of preparing the survey as well as some shocking results.
Mon, 15 May 2023 - 49min
483 - Skeptical Survey Interpretation
Kyle shares his own perspectives on challenges getting insight from surveys. The discussion ranges from commentary on the market research industry to specific advice for detecting disingenuous or fraudulent responses and filtering them from your analysis. Finally, he shares some quick thoughts on the usage of the Chi-Square test for interpreting cross tab results in survey analysis.

Wed, 10 May 2023 - 21min
482 - The Gallup Poll
Jeff Jones, a Senior Editor at Gallup, joins us today. His conversation with Kyle spanned a range of topics on Gallup’s poll creation process. He discussed how Gallup generates unbiased questionnaires, gets respondents, analyzes results, and everything in between.
Mon, 01 May 2023 - 40min
481 - Inclusive Study Group Formation at Scale
Gireeja Ranade, a University of California at Berkeley professor, speaks with us today. She presented her study on implementing inclusive study groups at scale and shared the observed student performance improvements after the intervention.
Tue, 25 Apr 2023 - 32min
480 - The PhilPapers Survey
Today, we are joined by David Bourget. David is an Associate Professor in Philosophy at Western University in London, Ontario. David is also the co-director of the PhilPapers Foundation and Director of the Center for Digital Philosophy. He joins us to discuss the PhilPapers Survey project.
The PhilPapers survey was initially taken in 2009, but there was a follow-up survey in 2020. David discussed the need for the subsequent survey and what changed. He mentioned the metric for measuring the opinion changes between the 2009 and 2020 surveys. He also shared future plans for the PhilPapers surveys.
Fri, 21 Apr 2023 - 31min
479 - Non-Response Bias
Today’s show focused on an essential part of surveys — missing values. This is typically caused by a low response rate or non-response from respondents. Yajuan Si is a Research Associate Professor at the Survey Research Center at the University of Michigan. She joins us to discuss dealing with bias from low survey response rates.
Mon, 10 Apr 2023 - 35min
478 - Measuring Trust in Robots with Likert Scales
We are joined by two guests today, Mariah, a Ph.D. student in the CORE Robotics Lab at Georgia Tech, and Matthew Gombolay, the Director of the CORE Robotics Lab. They both discuss practices for measuring a respondent’s perception in a survey.
Mon, 03 Apr 2023 - 47min
477 - CAREER Prediction
Ever wondered what your next career would be? Today, Keyon Vafa, a computer science Ph.D. student at Columbia University, joins us to discuss his latest research on developing a machine-learning model for career prediction. Keyon extensively spoke about how the model was developed and the possibilities it brings.
Mon, 27 Mar 2023 - 40min
476 - The Panel Study of Income Dynamics
Noura Insolera, a Research Investigator with the Panel Study of Income Dynamics (PSID), joins us to share how PSID conducts longitudinal household surveys. She also shared some interesting findings from their data exploration, particularly on the observation and trends in food insecurity.
Tue, 21 Mar 2023 - 34min
475 - Survey Design Working Session
Susan Gerbic joins Kyle to review some of the surveys Data Skeptic has launch, draft a new survey about podcast listening habits, and then review the results of that survey. You can see those results at the link below.
https://survey.dataskeptic.com/survey/result/1675102237053
Watch the videos Susan mentioned on her Youtube page at the link below.
https://www.youtube.com/playlist?list=PL7VAuaQDhPTVaLeI1IcpYph5lH19xA1u4
Tue, 14 Mar 2023 - 1h 01min
474 - Bot Detection and Dyadic Surveys
The use of social bots to fill out online surveys is becoming prevalent. Today, we speak with Sara Bybee, a postdoctoral research scholar at the University of Utah. Sara shares from her research, how she detected social bots, the strategies to curb them, and how underrepresented groups can be more represented in surveys.
Mon, 06 Mar 2023 - 35min
473 - Reproducible ESP Testing
Our guest today is Zoltán Kekecs, a Ph.D. holder in Behavioural Science. Zoltán highlights the problem of low replicability in journal papers and illustrates how researchers can better ensure complete replication of their research and findings. He used Bem’s experiment as an example, extensively talking about his methodology and results.
Mon, 20 Feb 2023 - 47min
472 - A Survey of Data Science Methodologies
On the show, Iñigo Martinez, a Ph.D. student at the University of Navarra shares his survey results which investigated how data practitioners perform data science projects. He revealed the methodologies typically used by data practitioners and the success factors in data science projects.
Mon, 13 Feb 2023 - 24min
471 - Opinion Dynamics Models
On the show today, Dino Carpentras, a post-doctoral researcher at the Computational Social Science group at ETH Zürich joins us to discuss how opinion dynamics models are built and validated. He explained how quantifying opinions is complex, and strategies to develop robust models for measuring and predicting public opinions.
Mon, 06 Feb 2023 - 35min
470 - Casual Affective Triggers
Crafting survey questions is one thing but getting your audience to fill it is yet another. On the show today, we speak with Alexander Nolte, an Associate Professor at the University of Tartu. Alexander discussed the use of Casual Affective Triggers (CAT) to incentivize people to accept survey invitations and improve the completion rate. He revealed the impact of CATs on survey response rates from a study he conducted.
Mon, 30 Jan 2023 - 35min
469 - Conversational Surveys
Traditional surveys have straight-jacket questions to be answered, thus restricting the information that can be gotten. Today, Ziang Xiao, a Postdoc Researcher in the FATE group at Microsoft Research Montréal, talks about conversational surveys, a type of survey that asks questions based on preceding answers. He discussed the benefits of conversational surveys and some of the challenges it poses.
Mon, 23 Jan 2023 - 39min
468 - Do Results Generalize for Privacy and Security Surveys
Today, Jenny Tang, a Ph.D. student of societal computing at Carnegie Mellon University discusses her work on the generalization of privacy and security surveys on platforms such as Amazon MTurk and Prolific. Jenny shared the drawbacks of using such online platforms, the discrepancies observed about the samples drawn, and key insights from her results.
Tue, 17 Jan 2023 - 40min
467 - 4 out of 5 Data Scientists Agree
This episode kicks off the new season of the show, Data Skeptic: Surveys. Linhda rejoins the show for a conversation with Kyle about her experience taking surveys and what questions she has for the season. Lastly, Kyle announces the launch of survey.dataskeptic.com, a new site we're launching to gather your opinions. Please take a moment and share your thoughts!
Tue, 10 Jan 2023 - 28min
466 - Crowdfunded Board Games
It may be intuitive to think crowdfunding a project drives its innovation and novelty, but there are no empirical studies that prove this. On the show, Johannes Wachs shares his research that sought to determine whether crowdfunding truly drives innovation. He used board games as a case study and shared the results he found.
Mon, 26 Dec 2022 - 34min
465 - Russian Election Interference Effectiveness
There were reports of Russia’s interference in the 2016 US elections. In today’s episode, Koustuv Saha, a researcher at Microsoft Research walks us through the effect of targeted ads for political campaigns. Using practical examples, he discusses how targeted ads can propagate fake news, its ripple effects on electioneering, and how to find a sweet spot with targeted ads.
Mon, 19 Dec 2022 - 41min
464 - Placement Laundering Fraud
There is an unsung kind of ad fraud brewing in the ad tech space — placement laundering fraud. On the show, Jeff Kline discusses what placement laundering fraud is, how it can be identified, and possible solutions to it. Listen to learn more.
Thu, 15 Dec 2022 - 32min
463 - Data Clean Rooms
Bosko Milekic, the Co-founder of Optable, a data collaboration platform for the media and advertising industry, joins us today. Bosko talked about the clean rooms, the technology driving data privacy during collaboration. He discussed why clean rooms are gaining widespread adoption, and how users can exploit Optable’s clean room platform for a secured data-sharing experience.
Mon, 12 Dec 2022 - 31min
462 - Dark Patterns in Site Design
Kerstin Bongard-Blanchy is a Research Associate at the University of Luxembourg. She joins us to discuss her study that investigated dark patterns in web designs. She discussed the results, the effect of dark patterns effect on users, whether an average user can detect them, and the way forward to a more ethical web space.
Mon, 05 Dec 2022 - 34min
461 - Internet Advertising Bureau Media Lab
We are joined by Anthony Katsur, the CEO of IAB Tech Lab. Anthony discusses standards within the ad tech industry. He explained how IAB Tech Lab set and propagates global standards, actions to ensure compliance from advertisers, and industry trends for a more privacy-centric ad tech space.
Sat, 03 Dec 2022 - 37min
460 - Your Mouse Reveals Your Gender and Age
When we navigate a webpage, it is fairly easy for our mouse movement to be tracked and collected. Today, Luis Leiva, a Professor of Computer Science discusses how these mouse tracking data can be used to predict age, gender and user attention. He also discusses the privacy concerns with mouse tracking data and possible ways it can be curtailed.
Mon, 28 Nov 2022 - 39min
459 - Measuring Web Search Behavior
On the show, Aleksandra Urman and Mykola Makhortykh join us to discuss their work on the comparative analysis of web search behavior using web tracking data. They shared interesting results from their analysis, bordering around the user preferences for search engines, demographic patterns, and differences between how men and women surf the net.
Mon, 21 Nov 2022 - 36min
458 - StrategyQA and Big Bench
Did Aristotle Use a Laptop? That's a question from the StrategyQA benchmark which highlights the stretch goals for current artificial intelligence systems. Answering a question like that requires several cognitive steps and reasoning. Constructing a dataset of similarly challenging questions is a major undertaking. On today's episode, Mor Geva returns to share details about the creation of StrategyQA and the larger Big Bench dataset it has been included in.
Fri, 18 Nov 2022 - 41min
457 - Ad Blockers Effect on News Consumption
While at first glance, the use of ad blockers drops the revenue of news publishers, this may not be completely true. On the show today, Shunyao Yan, an Assistant Professor in Marketing at Leavey School of Business, Santa Clara University, discussed the effect of ad blockers on news consumption and how ad blockers can potentially be helpful for news publishers.
Mon, 14 Nov 2022 - 38min
456 - Your Consent is Worth 75 Euros a Year
People who do not want their data tracked and shared online can pay a token for a cookie paywall. But are the websites keeping to their side of the bargain? Victor Morel, a Postdoc candidate at the Chalmers University of Technology joins us to discuss his work around auditing the activities of cookie paywalls. He discussed the findings from his analysis and proffers some solutions to making cookie paywalls more transparent.
Mon, 07 Nov 2022 - 24min
455 - Automated Email Generation for Targeted Attacks
The advancement of generative language models has been a force for good, but also for evil. On the show, Avisha Das, a post-doctoral scholar at the University of Texas Health Center, joins us to discuss how attackers use machine learning to create unsuspecting phishing emails. She also discussed how she used RNN for automated email generation, with the goal of defeating statistical detectors.
Mon, 31 Oct 2022 - 45min
454 - Tribal Marketing
Peter Gloor, a Research Scientist at the MIT Center for Collective Intelligence, takes us on a new world of tribe classification. He extensively discussed the need for such classification on the internet and how he built a machine learning model that does it. Listen to find out more!
Mon, 24 Oct 2022 - 37min

Weitere Folgen anzeigen

Nach Genre filtern

Data Skeptic

Podcasts ähnlich wie Data Skeptic