TBW
https://docs.nvidia.com/deeplearning/performance/dl-performance-convolutional/index.html
https://gist.github.com/mingfeima/595f63e5dd2ac6f87fdb47df4ffe4772
https://www.intel.com/content/www/us/en/docs/onednn/developer-guide-reference/2023-1/understanding-memory-formats.html
https://cs231n.github.io/convolutional-networks/
https://cvml-expertguide.net/terms/dl/layers/batch-normalization-layer/
https://qiita.com/Hatomugi/items/d00c1a7df07e0e3925a8
https://qiita.com/omiita/items/1735c1d048fe5f611f80
https://nonbiri-tereka.hatenablog.com/entry/2016/03/10/073633
https://qiita.com/omiita/items/d24568a835da6911b01e
https://acro-engineer.hatenablog.com/entry/2019/12/25/130000
https://medium.com/aureliantactics/ppo-hyperparameters-and-ranges-6fc2d29bccbe
https://stats.stackexchange.com/questions/153531/what-is-batch-size-in-neural-network