Deep Learning

TBW

N: Batch
C: Channels
H: Height
W: Width

Input
Hidden
Output

MLP: Multi-Layer Perceptron
CNN: Convolutional Neural Network

https://cs231n.github.io/convolutional-networks/

RNN: Recurrent Neural Network
Transformer

Linear
Convolution
Dropout
Pooling
- Max Pooling
- Adaptive Max Pooling
Normalization
- Batch Normalization

https://cvml-expertguide.net/terms/dl/layers/batch-normalization-layer/

Recurrent
- RNN
- LSTM
Transformer
- Encoder
- Decoder

Number

ReLU [Common Choice]
Tanh
Sigmoid
SoftMax: Typically used in the output layer for (multi-class) classification problems

GELU: Suitable for transformer networks [Smart Choice]
Leaky ReLU

https://qiita.com/Hatomugi/items/d00c1a7df07e0e3925a8

Regression
1. MSE: More sensitive to outliers in the data [Common Choice]
2. MAE: Less sensitive to outliers in the data
3. Huber Loss: Switching between MSE and MAE with a threshold [Smart Choice]
Classification
1. Cross-Entropy: Prime candidate for classification

https://qiita.com/omiita/items/1735c1d048fe5f611f80

SGD: Stochastic Gradient Descent
Momentum: SGD + Moving Average
RMSProp: Root Mean Square Propagation in 2012
Adam: Momentum + RMSProp in 2014 [Common Choice]
RAdam: Rectified Adam in 2020 [Smart Choice]

Learning Rate: Initial / Final / Fixed or Time-Based Decay
Batch Size: Greater leads faster training and avoiding trapping in local minima, but lower accuracy

Gamma
Epsilon
Tau
Entropy

https://xsig.ipsj.or.jp/wp-content/uploads/sites/6/2019/06/imai_20190529_xSiG_public_s.pdf

https://nonbiri-tereka.hatenablog.com/entry/2016/03/10/073633
https://qiita.com/omiita/items/d24568a835da6911b01e
https://acro-engineer.hatenablog.com/entry/2019/12/25/130000
https://medium.com/aureliantactics/ppo-hyperparameters-and-ranges-6fc2d29bccbe
https://stats.stackexchange.com/questions/153531/what-is-batch-size-in-neural-network

Daiphys is a professional services company in research and development of leading-edge technologies in science and engineering.
Get started accelerating your business through our deep expertise in R&D with AI, quantum computing, and space development; please get in touch with Daiphys today!

Daiphys Technologies LLC - https://www.daiphys.com/

Deep Learning

Basics

Data Preparation

Image

Information - How to Resize Input Images (e.g., AlexNet, ResNet)

Information - Automatic Image Augmentation

Information - Tensor Layouts in Memory: NCHW vs NHWC

Network

Design

Architecture

Layer

Node

Activation

Loss Function

Optimizer

Hyperparameter

Reinforcement Learning

Compiler

References

Acknowledgments