Glossary

0-9

1-shot learning 5G + AI 6DoF pose estimation 7D representation 8-bit quantization 2-stage detector 4D data 0-shot learning 9-layer network 3D convolution

A

AGI / Artificial General Intelligence Autoencoder Attention Algorithm Artificial Intelligence (AI)

B

Backpropagation BERT Boosting Batch Normalization Bias

C

Chatbot Clustering CNN / Convolutional Neural Network Cross-Validation Classifier / Classification

D

Deep Learning Deepfake Discriminative Model Deterministic Model Data Augmentation

E

Embedding Encoder Epoch Ensemble Learning Explainable AI (XAI)

F

Fine-tuning Fusion / Multimodal Fusion Forward Propagation Foundation Model Feature Extraction

G

GAN / Generative Adversarial Network Gradient Descent Grounding Graph Neural Network (GNN)Generative AI

H

Hyperparameter Heuristic Hidden Layer Hierarchical Model Hallucination

I

Imbalanced Data Interpretability Instruction tuning Instance / Sample Intelligence Amplification / Augmentation

J

JAX Jittering Joint Embedding JSONL / JSON-lines Juxtaposition

K

KL Divergence (Kullback–Leibler Divergence)K-means Clustering K-Shot Learning Kernel Trick Knowledge Distillation

L

Latent Variable Loss Function LSTM / Long Short-Term Memory Large Language Model (LLM)Learning Rate

M

Multimodal / Multimodality Machine Learning (ML)Meta-learning Model Multi-head Attention

N

Normalization Neural Network NLP / Natural Language Processing NLU / Natural Language Understanding Novelty Detection / Anomaly Detection

O

Objective Function Online Learning One-hot Encoding Overfitting Optimizer

P

Policy / Reinforcement Learning Policy Pooling Pretraining Prompt Parameter

Q

Queue / Buffer Quantization Q-learning Query Quality Estimation

R

Retrieval Augmented Generation (RAG)Representation Learning Reinforcement Learning (RL)Regularization RNN / Recurrent Neural Network

S

Supervised Learning Self-Supervised Learning Sequence Modeling Sampling Softmax

T

Training Data Tokenizer Transfer Learning Transformer Tuning / Hyperparameter Tuning

U

Universal Approximation Theorem Unsupervised Learning U-Net Underfitting Uncertainty Estimation

V

Variational Autoencoder (VAE)Vector Embedding Vanishing / Exploding Gradient Validation Set Vision Transformer (ViT)

W

Weak Supervision Weight Decay Whitening / Whitening Transformation Word Embedding Workflow

X

XOR problem X-axis / feature axis XAI / Explainable AI XLM XLNet

Y

Y-axis / feature axis Y-transform / YUV YAGNI (You Aren't Gonna Need It)Yield (model yield / throughput)Yoga of AI

Z

Z-score Normalization Zero-gradient phenomenon Zero-shot Learning / Zero-shot inference Zero-centric / Zero-bias initialization Zygosity in augmentation

Word Embedding là gì

Word Embedding là một kỹ thuật được sử dụng để chuyển đổi từ thành vector cho các tác vụ xử lý ngôn ngữ tự nhiên (NLP). Bằng cách ánh xạ các từ vào một không gian vector liên tục, Word Embedding cho phép máy tính hiểu và xử lý các mối quan hệ ngữ nghĩa trong ngôn ngữ.

Trọng tâm của Word Embedding nằm ở các thuật toán như Word2Vec, GloVe và FastText. Các thuật toán này phân tích khối lượng lớn dữ liệu văn bản để học cách các từ được sử dụng trong các ngữ cảnh khác nhau, từ đó chuyển đổi chúng thành các biểu diễn vector. Một kịch bản điển hình là khi các vector cho 'vua' và 'nữ hoàng' phản ánh một mối quan hệ tương tự như giữa 'đàn ông' và 'đàn bà'.

Các lợi ích của Word Embedding bao gồm khả năng xử lý khối lượng lớn dữ liệu văn bản, cung cấp sự hiểu biết ngữ nghĩa tốt hơn và có thể áp dụng cho nhiều mô hình học máy khác nhau. Tuy nhiên, nó cũng có một số nhược điểm, như xử lý kém đối với các từ hiếm và khả năng gây ra thiên kiến. Do đó, cần lưu ý cẩn thận khi sử dụng Word Embedding để giảm thiểu những vấn đề này.

Trong tương lai, khi công nghệ học sâu phát triển, Word Embedding có thể kết hợp với các mô hình phức tạp hơn như Transformers, từ đó cải thiện độ chính xác và tính linh hoạt trong việc hiểu ngôn ngữ.