We're sorry but this page doesn't work properly without JavaScript enabled. Please enable it to continue.
Feedback

Inventing Curriculum using Python and spaCy

00:00

Formal Metadata

Title
Inventing Curriculum using Python and spaCy
Title of Series
Number of Parts
637
Author
Contributors
License
CC Attribution 2.0 Belgium:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.
Identifiers
Publisher
Release Date
Language

Content Metadata

Subject Area
Genre
Abstract
Are you an educator who wants to design teach an industry-aligned curriculum? Then you have come to the right place. In this talk, we will show how to design a better curriculum using natural language processing libraries in python, i.e., spaCy and Textacy. The curriculum in the general and undergraduate curriculum, in particular, is one of the most important pillars of an education system. The undergraduate curriculum has two main objectives i.e. employability and higher education. The greatest challenge in designing an undergraduate curriculum is achieving a balance between employability skills and laying the foundation for higher education. Generally, the curriculum is a combination of core technical subjects, professional electives, humanities, and skill-oriented subjects. We used natural language processing and machine learning packages in Python to build a curriculum design system. The steps to build a curriculum design system are described below: 1. The dataset was built from the job profiles from different job listing websites like stackoverflow.com, indeed.com, linkedin.com, and monster.com. Also from the syllabus of competitive exams and qualifying exams for higher education. 2. On the dataset, we applied natural language processing techniques to identify the subjects and subject content. For natural language processing, we used spaCy an industrial-strength Natural Language Processing package in Python. 3. To generate syllabus content for a particular subject, a pointer-generator network was used. The pointer generator network is a text summarization technique that combines extractive and abstractive summarization techniques. The extractive summarization technique extracts keywords from the dataset, whereas the abstractive summarization technique generates new text from the existing text. The pointer-generator network was implemented using the scikit-learn machine learning package in Python. 4. The generated curriculum was then compared with the existing curriculum to get insights like, how much percent of the curriculum is industry oriented, how much percent of the curriculum is aimed at higher education and job-oriented skills. At this step, we used the ROGUE (Recall-Oriented Understudy Gisting Evaluation) metric to compare the generated curriculum against the reference/proposed curriculum 5. The above steps can be repeated with modified parameters to get better insights and curriculum. This also gives us an idea of how we can have an evolving curriculum that can help us bridge the gap between industry and academia.
Computer programGoogolPoint cloudMathematicsProjective planeStudent's t-testComputer programGoogolComputer animationXMLUML
Content (media)Observational studyLatent heatInformationStandard deviationSoftware frameworkOpen sourceData structureBounded variationCore dumpOpen setData managementCodecComputer programInductive reasoningIntegrated development environmentTotal S.A.NumberArchitectureServer (computing)AerodynamicsWebsiteMultitier architectureClient (computing)System programmingSubject indexingStructural loadBlock (periodic table)ScalabilityCache (computing)Execution unitCommunications protocolWeb browserAbelian categoryCodeVariable (mathematics)Function (mathematics)ProgrammschleifeCondition numberScripting languageObject (grammar)Form (programming)Component-based software engineeringDatabaseArray data structureTable (information)Query languageEvent horizonCross-site scriptingDocument Type DefinitionService (economics)Web pageQuicksortBuildingJava appletSoftware developerWeightSoftware testingIdeal (ethics)ParsingStack (abstract data type)Design by contractType theoryInteractive televisionFunctional (mathematics)Fundamental theorem of algebraAbstractionComputer networkPointer (computer programming)SubsetStandard deviationStudent's t-testTerm (mathematics)Sampling (statistics)Combinational logicComputer configurationObservational studyGreatest elementInformationSheaf (mathematics)SubsetWeb 2.0Different (Kate Ryan album)Multiplication signDiagramComputer programDescriptive statisticsType theoryPointer (computer programming)Execution unitResultantCore dumpStatistical hypothesis testingSoftware frameworkServer (computing)Web-DesignerData structureContent (media)Context awarenessProcess (computing)WebsiteOpen setBootstrap aggregatingStack (abstract data type)Selectivity (electronic)Universe (mathematics)Object (grammar)2 (number)WordProjective planeSoftware testingCASE <Informatik>Engineering physicsData storage deviceSoftwareWeb pageForcing (mathematics)Interior (topology)Water vaporGroup actionView (database)Natural languageService (economics)Scaling (geometry)Condition numberFrequencyPhysical systemState of matterLevel (video gaming)AdditionNumberWindowMereologyAbstractionBlogUsabilityQuicksortComputer animation
NumberFrequencyMathematical analysisQuicksortVolumeProcess (computing)Natural languageSoftwareComputing platformInformation extractionSystem programmingLibrary (computing)InformationTexture mappingString (computer science)Array data structureData modelParsingPattern recognitionStatisticsEndliche ModelltheorieState of matterDisintegrationVector graphicsTask (computing)Variety (linguistics)Fundamental theorem of algebraSystem identificationFunction (mathematics)Extension (kinesiology)UsabilityContent (media)MiniDiscFile formatStreaming mediaDatabase normalizationThermodynamischer ProzessTerm (mathematics)Element (mathematics)Set (mathematics)Similarity (geometry)Metric systemPairwise comparisonGradientEndliche ModelltheorieProjective planeVisualization (computer graphics)CodeWordNatural languageProcess (computing)Library (computing)Information extractionFreewareCombinational logicProduct (business)Variety (linguistics)WebsiteTask (computing)FrequencyPreprocessorNumberCartesian coordinate systemMathematical analysisToken ringOpen sourceBuildingVolume (thermodynamics)Thermodynamischer ProzessStreaming mediaPhysical systemFunctional (mathematics)MereologySelectivity (electronic)Set (mathematics)Speech synthesisSoftwareExtension (kinesiology)MiniDiscContext awarenessOrder (biology)Form (programming)Computing platformSystem identificationPattern recognitionPoint (geometry)Raw image formatDampingVector spaceSpacetimeAuthorizationQuantum stateCondition numberSelf-organizationMetropolitan area networkStress (mechanics)InformationLevel (video gaming)GoogolUniform resource locatorBarrelled spaceResonatorSoftware testingFamilyForcing (mathematics)Object (grammar)DataflowEstimationMathematicsPrice indexChemical equationInterpreter (computing)Inclusion mapWave packetComputer animation
Limit (category theory)OntologySelf-organizationUsabilityOpen setComputer programSystem callOrder (biology)Library (computing)Endliche ModelltheorieNumberToken ringLeakNatural languageDescriptive statisticsFamilyObservational studyFrequencyWordCASE <Informatik>Process (computing)Statement (computer science)ArmForm (programming)SpacetimeAdditionCollisionCodeElectronic mailing listTerm (mathematics)Set (mathematics)RootEntire functionTask (computing)Object (grammar)Single-precision floating-point formatPreprocessorLine (geometry)Projective planeUniform resource locatorEmailJSON
Range (statistics)Source codeRepository (publishing)Student's t-testDisk read-and-write headResultantOrder (biology)Process (computing)Graph coloringMultiplication signLevel (video gaming)NumberInformationObject (grammar)Electronic mailing listTerm (mathematics)Perspective (visual)Focus (optics)Sampling (statistics)
BootingVaporQuery languageElectronic data interchangeConvex hullFrequencyWeb 2.0NumberTerm (mathematics)Token ringWordMultiplication signDiagramFunction (mathematics)Grass (card game)Discrete element methodForcing (mathematics)Amenable groupSource code
Translation (relic)Multiplication signFunction (mathematics)Term (mathematics)CASE <Informatik>Source code
NumberPreprocessorElectronic mailing listQuery languageDivisorStaff (military)CoroutineForcing (mathematics)AdditionTouchscreenObject (grammar)Focus (optics)GoogolComputer clusterWordCombinational logicFlow separationResultantCASE <Informatik>
Natural languageThermodynamischer ProzessPointer (computer programming)Computer networkDivisorNatural languagePointer (computer programming)DivisorResultantAreaData miningForestHypermediaComputer animation
Transcript: English(auto-generated)