Glossary
Datasets
AblationActive Learning (Machine Learning)Adversarial Machine LearningAffective AIAI AgentsAI and EducationAI and FinanceAI and MedicineAI AssistantsAI EthicsAI Generated MusicAI HallucinationsAI HardwareAI in Customer ServiceAI Recommendation AlgorithmsAI Video GenerationAI Voice TransferApproximate Dynamic ProgrammingArtificial Super IntelligenceBackpropagationBayesian Machine LearningBinary Classification AIChatbotsConversational AIConvolutional Neural NetworksCounterfactual Explanations in AICurse of DimensionalityData LabelingDeep LearningDeep Reinforcement LearningDifferential PrivacyDimensionality ReductionEmbedding LayerEmergent BehaviorExplainable AIF1 Score in Machine LearningF2 ScoreFeedforward Neural NetworkFine Tuning in Deep LearningGated Recurrent UnitGenerative AIGraph Neural NetworksHidden LayerHyperparameter TuningIntelligent Document ProcessingLarge Language Model (LLM)Loss FunctionMachine LearningMachine Learning in Algorithmic TradingModel DriftMultimodal LearningNatural Language Generation (NLG)Natural Language Processing (NLP)Natural Language Querying (NLQ)Natural Language Understanding (NLU)Neural Text-to-Speech (NTTS)Objective FunctionPrecision and RecallPretrainingRecurrent Neural NetworksTransformersUnsupervised LearningVoice CloningZero-shot Classification Models
AI Voice AgentsAlphaGoAlphaGo ZeroAutoregressive ModelBERTChatGPTChess botsContinuous Learning SystemsDall-EDiffusion ModelsDistilBERTEco-friendly AIFoundation ModelsGenerative Teaching NetworksGoogle's BardHidden Markov Models (HMMs)Human-centered AIInference EngineLarge Language Model (LLM)Llama 2Limited Memory AILLM CollectionMambaMetacognitive Learning Models Midjourney (Image Generation)MistralMultimodal AI Models and ModalitiesNeuralinkOpenAI SoraOpenAI WhisperPerceptronProbabilistic Models in Machine LearningRoBERTaRule-Based AISelf-healing AISpeech-to-text modelsText-to-Speech ModelsWhisper v2Whisper v3XLNet
Acoustic ModelsActivation FunctionsAdaGradAI AlignmentAI Emotion RecognitionAI GuardrailsAI Speech EnhancementArticulatory SynthesisAttention MechanismsAutoregressive ModelBatch Gradient DescentBeam Search AlgorithmBenchmarkingCandidate SamplingCapsule Neural NetworkCausal InferenceClassificationClustering AlgorithmsCognitive ComputingCognitive MapComputational CreativityComputational PhenotypingConditional Variational AutoencodersConcatenative SynthesisContext-Aware ComputingContrastive LearningCURE AlgorithmData AugmentationDeepfake DetectionDiffusionDomain AdaptationDouble DescentEnd-to-end LearningEvolutionary AlgorithmsExpectation MaximizationFeature Store for Machine LearningFlajolet-Martin AlgorithmForward PropagationGaussian ProcessesGenerative Adversarial Networks (GANs)Gradient Boosting Machines (GBMs)Gradient ClippingGradient ScalingGrapheme-to-Phoneme Conversion (G2P)GroundingHyperparametersHomograph DisambiguationHooke-Jeeves AlgorithmInstruction TuningKeyphrase ExtractionKnowledge DistillationKnowledge Representation and Reasoningk-ShinglesLatent Dirichlet Allocation (LDA)Markov Decision ProcessMetaheuristic AlgorithmsMixture of ExpertsModel InterpretabilityMultimodal AINeural Radiance FieldsNeural Text-to-Speech (NTTS)One-Shot LearningOnline Gradient DescentOut-of-Distribution DetectionOverfitting and UnderfittingParametric Neural Networks Prompt ChainingPrompt EngineeringPrompt TuningQuantum Machine Learning AlgorithmsRegularizationRepresentation LearningRetrieval-Augmented Generation (RAG)RLHFSemantic Search AlgorithmsSemi-structured dataSentiment AnalysisSequence ModelingSemantic KernelSemantic NetworksStatistical Relational LearningSymbolic AITokenizationTransfer LearningVoice CloningWinnow AlgorithmWord Embeddings
Last updated on May 7, 20241 min read
Datasets
Dive into the 'Datasets' section and unravel the intricate processes powering AI's magic. From the questions of SQuAD to the endless libraries of The Pile, journey through the stages that bring AI to life.
Artificial intelligence requires massive amounts of data to yield practical results. This page lists a few of the many massive datasets that various models use to learn anything from natural language to image generation