CS224N / 学习笔记

概念索引

核心概念 · 术语速查 · 按分类浏览 · 共 494 个概念

高效推理与部署

53 个

Acceptance Rate

adaptive computation

AnchorAttention

collaborative inference

Cumulative Acceptance Rate

Element-wise LUT

Fast Graph Decoder

Multi-Token Prediction

Speculative Decoding

split computing

structural reparameterization

Top-K Selection

Vocabulary Trimming

基础理论

60 个

遍历动力系统

归一化第二矩

计算复杂度

无放回采样

误差-性能折衷

压缩顺序定理

整数线性规划

Chain-of-Thought

Cholesky 分解

Excess Kurtosis

Extended Golay Code

Gram-Schmidt 正交化

In-Context Learning

Lebesgue 测度

Long-Tailed Distribution

Maximal Marginal Relevance

Mechanistic Interpretability

Monte Carlo 估计

Multi-Probe LSH

multi-probe retrieval

neural scaling law

Rate-Distortion

Shannon 率失真

Sparse Autoencoder

Spherical K-Means

Test-Time Compute

Tree-of-Thought

剪枝与稀疏化

52 个

剪枝压缩比

结构化剪枝

可学习门控

Block Influence

Magnitude Pruning

Visual Token Pruning

量化与低秩

52 个

Hadamard rotation

Shape-Gain 量化

模型增长

6 个

灾难性遗忘

function-preserving

Loss of Plasticity

progressive training

深度学习基础

57 个

焦点频率损失

形态学操作

余弦相似度

自回归解码

自适应阈值

Activation Function

Backpropagation

Batch Normalization

Binary Cross-Entropy

Computation Graph

Cross-Attention

Cross-Entropy Loss

Cross-Modal Attention

Gradient Clipping

Gradient Descent

Greedy Decoding

Hadamard 变换

Hadamard 乘积

hierarchical softmax

Key-Value Projection

Linear Attention

Multi-Head Attention

Multi-Head Self-Attention

Multi-Task Loss

Positional Encoding

Sobel 边缘检测

Softmax Attention

sparse attention

Temperature Sampling

Vanishing Gradient

视觉任务

7 个

类增量学习

Diffusion Models

Vision-Language Models

数据集与评估

35 个

网络架构

88 个

循环神经网络

classification head

Decision Transformer

Encoder-Decoder

Highway Network

Implicit Neural Representation

LLaVA-OneVision

训练优化

39 个

知识蒸馏

15 个

self-distillation

NAS与自动化设计

4 个

神经架构搜索

NLP基础

19 个

Co-occurrence Matrix

Dependency Parsing

Distributional Semantics

Machine Translation

Negative Sampling

Sentiment Analysis

Transition-Based Parsing

待分类

7 个

Value Alignment