Tensorflow(@CVision)

احتمالا قبلا در مورد هوش مصنوعی ای که بازی شطرنج بازی میکرد و اکثر بازیکنان حرفه ای شطرنج را برده شنیده اید! اما حقیقت این است که آن هوش مصنوعی تنها قادر بود شطرنج بازی کند و در واقع فقط برای آن بازی آموزش دیده بود و توانایی هیچ کار دیگری نداشت!

محققان گوگل در مقاله ی سال 2015 نیچر با استفاده از ایده ی #یادگیری_تقویتی شبکه عصبی طراحی کردند که قادر بود که خودش بازی ها را یاد بگیرد!
این هوش مصنوعی که به عنوان ورودی صفحه RGB بازی و به عنوان پاداش امتیاز کسب شده از بازی را دریافت میکرد، با 50 بازی آتاری تست شد و نتایج اعجاب انگیزی داشت!

نکته ی جالب این بود یک ساختار شبکه، توانسته بود خودش هر یک از بازی های آتاری را بیاموزد و در بسیاری از بازی ها از بهترین بازیکن ها ی این بازی ها بهتر بازی کند.

ویدیو نتایج بازی منتشر شده توسط google deepmind:
https://youtu.be/TmPfTpjtdgg
یا
http://deepnn.ir/tensorflow-telegram-files/DQN%20Breakout.mp4

با استفاده از ایده‌ی #یادگیری_تقویتی، نیاز به داده هایی که انسان در آن دست برده کم تر شده و مدل یا ربات می‌تواند خودش از دنیای پیرامون اطلاعات کسب کند. در این نوع یادگیری در نهایت تنها پاداش به مدل داده می‌شود.

#reinforcement_learning #rl #deepmind #atari

YouTube

DQN Breakout

This video illustrates the improvement in the performance of DQN over training (i.e. after 100, 200, 400 and 600 episodes). After 600 episodes DQN finds and exploits the optimal strategy in this game, which is to make a tunnel around the side, and then allow…

2.6K views18:52

Tensorflow(@CVision)

منابع مفید برای شروع #یادگیری_تقویتی
#مسیر_یادگیری، #منابع، #کورس

⭕️Resources for #Reinforcement_Learning:

Wikipedia Article for General Info on Reinforcement Learning

https://stackoverflow.com/questions/43288550/iopub-data-rate-exceeded-when-viewing-image-in-jupyter-notebook

Simpler Reinforcement Learning API with #Keras. It's a lot easier to play with this than manually implement techniques in #TensorFlow.

https://github.com/matthiasplappert/keras-rl

Huge Repo of Implemented RL Algos:

https://github.com/dennybritz/reinforcement-learning

#Berkeley #Course on RL and DL:

http://rll.berkeley.edu/deeprlcourse/

Nice Series of Articles on RL:

https://medium.com/emergent-future/simple-reinforcement-learning-with-tensorflow-part-0-q-learning-with-tables-and-neural-networks-d195264329d0

Some great books on RL:

* Richard S. Sutton and Andrew G. Barto. Reinforcement Learning.
* Stuart J. Russell and Peter Norvig. Artificial Intelligence. A Modern Approach.
* Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Deep Learning.

Stack Overflow

IOPub data rate exceeded in Jupyter notebook (when viewing image)

I want to view an image in Jupyter notebook. It's a 9.9MB .png file.

from IPython.display import Image
Image(filename='path_to_image/image.png')
I get the below error:

IOPub data rate exceeded.
...

4.3K viewsAlireza Akhavan, edited 13:01

About

Blog

Apps

Platform