算法转载收藏

2024年6月9日 175次阅读

研究课题

☆☆☆超级好的一个分类List：https://handong1587.github.io/categories.html#deep_learning-ref
深度丨AI界的七大未解之谜：OpenAI丢出一组AI研究课题
 Olympiad Inequalities

optimization

数值优化（Numerical Optimization）学习系列-目录：蛮不错的数值优化书的读书笔记，推荐看。

Reinforcement Learning

RL List：https://handong1587.github.io/deep_learning/2015/10/09/reinforcement-learning.html

–

Unsupervised Learning

Unsuperiised Learning List：https://handong1587.github.io/deep_learning/2015/10/09/unsupervised-learning.html

–

MCTS && AlphaGo

Chernoff-Hoeffding Bound
A Survey of Monte Carlo Tree Search Methods, Cameron Browne et al., 2012
UCT: Bandit based Monte-Carlo Planning, Levente Kocsis et al., 2006
UCB1: Finite-time Analysis of the Multiarmed Bandit Problem, PETER AUER et al., 2002
Introduction to Monte Carlo Tree Search；
蒙特卡洛树搜索(Monte Calro Tree Search, MCTS)究竟是啥
知乎：蒙特卡洛树是什么算法？
高级强化学习系列第二讲探索-利用困境（exploration-exploitation dilemma）（三）（二）（一）
浅述：从 Minimax 到 AlphaZero，完全信息博弈之路（1）
UCT算法百度百科
深入浅出看懂AlphaGo元：：，：：深入浅出看懂AlphaGo如何下棋

进化算法，EA

TBD

Training DL

☆☆☆Training Deep Neural Networks

Learning Rate & 超参优化

[1706.00764] Hyperparameter Optimization: A Spectral Approach
2013 – Hinton：On the importance of initialization and momentum in deep learning
2017 – YellowFin and the Art of Momentum Tuning

鞍点，Local Minima，Generalization

Michael I. Jordan： Accelerated Gradient Descent Escapes Saddle Points Faster than Gradient Descent

认知、类脑计算

2016.5.1：《科学》封面论文作者力作：搭建像人一样思考和学习的机器（附论文下载）
2016-09-25：桥接认知科学和强化学习第1部分：生成论
行动是为了感知
Alva Noë在他书中写的很深入，不仅仅是论述感知是为了行动。他提出了更激进的主张，那就是行动是为了感知。他所说的是，如果没有行动，就没有真正意义上对世界的体验。这本书阐述了行动怎样让感知成为可能。Noë指出，我们用眼睛所看到的图像，仅仅提供给我们了真实世界很小一部分的信息，这些信息并不符合我们体验世界的方式。碗在我们的视网膜上留下了椭圆形的印象。那这个椭圆的形状是怎么被认为是一个碗呢？为了回答这个问题，Noë引用了感觉运动的技巧，它可以让我们理解真实世界相对于我们本身和环境的改变方式。这是因为我们可以绕着碗移动，也可以转动碗，所以我们能了解它是椭圆形的。通过建立感觉运动的技巧，我们可以给原本毫无意义的世界赋予相应的意义。

计算机视觉算法集合

Re-ID List：https://handong1587.github.io/deep_learning/2015/10/09/re-id.html
Object Detection List：https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html
Generative Adversarial Networks List：https://handong1587.github.io/deep_learning/2015/10/09/gan.html

模型压缩

Acceleration and Model Compression List：https://handong1587.github.io/deep_learning/2015/10/09/acceleration-model-compression.html
是

AutoML