We're sorry but this page doesn't work properly without JavaScript enabled. Please enable it to continue.
Feedback

Lies, damned lies and large language models

00:00

Formal Metadata

Title
Lies, damned lies and large language models
Title of Series
Number of Parts
131
Author
Contributors
License
CC Attribution - NonCommercial - ShareAlike 3.0 Unported:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal and non-commercial purpose as long as the work is attributed to the author in the manner specified by the author or licensor and the work or content is shared also in adapted form only under the conditions of this
Identifiers
Publisher
Release Date
Language

Content Metadata

Subject Area
Genre
Abstract
Would you like to use large language models (LLMs) in your own project, but are troubled by their tendency to frequently “hallucinate”, or produce incorrect information? Have you ever wondered if there was a way to easily measure an LLM’s hallucination rate, and compare this against other models? And would you like to learn how to help LLMs produce more accurate information? In this talk, we’ll have a look at some of the main reasons that hallucinations occur in LLMs, and then focus on how we can measure one specific type of hallucination: the tendency of models to regurgitate misinformation that they have learned from their training data. We’ll explore how we can easily measure this type of hallucination in LLMs using a dataset called TruthfulQA in conjunction with Python tooling including Hugging Face’s `datasets` and `transformers` packages, and the `langchain` package. We’ll end by looking at recent initiatives to reduce hallucinations in LLMs, using a technique called retrieval augmented generation (RAG). We’ll look at how and why RAG makes LLMs less likely to hallucinate, and how this can help make these models more reliable and usable in a range of contexts.
Formal languageEndliche ModelltheorieProcess (computing)Object-oriented analysis and designOrder (biology)Multiplication signFormal languageMoment (mathematics)Endliche ModelltheorieView (database)Computer animationLecture/Conference
SpacetimeOrdinary differential equationComplete metric spaceComputer iconEndliche ModelltheorieSqueeze theoremWebsitePlastikkarteFormal languageLinear regressionMathematical modelFormal languageComputer architectureWindowOpen sourceNatural languageMaschinelle ÜbersetzungData compressionContent (media)Wave packetRow (database)Type theoryVariety (linguistics)Multiplication signFunction (mathematics)Range (statistics)AirfoilSpacetimeTask (computing)Electric generatorParameter (computer programming)Context awarenessWordEndliche ModelltheorieSemiconductor memoryTransformation (genetics)Set (mathematics)MereologyDependent and independent variablesSoftwarePredictability2 (number)BitDifferent (Kate Ryan album)Formal grammarReal numberCartesian coordinate systemPoint (geometry)Software developerWeightOrder (biology)AreaInformationStructural loadPower (physics)ScalabilityFamilyWeb 2.0CASE <Informatik>Parametrische ErregungSequenceScaling (geometry)Speech synthesisRule of inferenceProcess (computing)Web crawlerLecture/ConferenceMeeting/InterviewComputer animation
Endliche ModelltheorieSign (mathematics)Web crawlerRule of inferenceFilter <Stochastik>QuicksortCombinational logicMeta elementLink (knot theory)Content (media)Wave packetOpen sourceNatural languageDigital photographyControl theoryInformation retrievalMathematical modelType theoryEndliche ModelltheorieWeb crawlerWeb pageProjective planeSet (mathematics)InternetworkingProcess (computing)Category of beingRange (statistics)Source codeDifferent (Kate Ryan album)Cartesian coordinate systemRevision controlMathematicsVideo gameAugmented realityBounded variationWordReal numberBlock (periodic table)Axiom of choiceElectronic mailing listTheoryWeb 2.0Power (physics)Core dumpTask (computing)Annihilator (ring theory)Order (biology)2 (number)Multiplication signBit rateFeedbackInclusion mapTunisMobile appOnline chatExtreme programmingMeasurementParametrische ErregungLine (geometry)Scaling (geometry)BitVector potentialSimulationBlogPiLagrange-MethodeBookmark (World Wide Web)AirfoilLinear regressionComputer animation
ChainTemplate (C++)Dependent and independent variablesMathematical modelView (database)Codierung <Programmierung>Mathematical modelOpen setEndliche ModelltheorieAxiom of choiceData dictionaryObject (grammar)Revision controlSet (mathematics)Electronic mailing listMultiplicationTurbo-CodeField (computer science)Different (Kate Ryan album)BitContext awarenessOrder (biology)Bit rateChainLoop (music)Multiplication signOpen sourceFunction (mathematics)MereologyCASE <Informatik>MeasurementComputer programmingPerspective (visual)Exception handlingBookmark (World Wide Web)Selectivity (electronic)Phase transitionSubsetComputer animation
AirfoilControl flowMultiplication signParameter (computer programming)Formal languageSet (mathematics)MeasurementTheoryEndliche ModelltheorieBit rateGodTask (computing)Hydraulic jumpNumberMathematical modelWave packetInformationShared memoryElectric generatorDependent and independent variablesLatent heatBitComplex (psychology)Order (biology)Sheaf (mathematics)Slide ruleVariety (linguistics)MultiplicationFunction (mathematics)Moment (mathematics)Context awarenessoutputSource codeNatural languageType theoryProcess modelingInformation retrievalDemo (music)Line (geometry)Domain-specific languageCollaborationismReduction of order2 (number)Computer animation
Abelian categoryCategory of beingFrame problemTurbo-CodeHand fanTotal S.A.View (database)WindowRootSingle-precision floating-point formatSlide ruleShared memorySource codeTouch typingProbability density functionMathematical modelVirtual machineBit rateEndliche ModelltheorieFormal languageQR codeComputer animation
Formal languageEndliche ModelltheorieBit rateSlide ruleGoodness of fitLecture/Conference
outputFunction (mathematics)Multiplication signSemiconductor memoryEndliche ModelltheorieConsistencyContext awarenessOnline chatMathematical modelSlide ruleWindowRight angleLecture/Conference
Wave packetMereologyBit rateDirection (geometry)Lecture/Conference
GoogolMeasurementSoftware testingKey (cryptography)Performance appraisalSet (mathematics)MereologyWave packetValuation (algebra)Arc (geometry)AbstractionRight angleLecture/Conference
Channel capacityEndliche ModelltheorieSet (mathematics)Goodness of fitCross-correlationFormal languageData qualityLecture/ConferenceComputer animation
Endliche ModelltheorieFreewareMathematical modelBenchmarkOpen setSmith chartArchitectureLattice (order)PressureSlide ruleNumberOpen setWhiteboardMathematical modelRight angleInformationAnnihilator (ring theory)Multiplication signLecture/ConferenceComputer animation
RootSystem on a chipMultiplication signComputer animationLecture/Conference
Transcript: English(auto-generated)