"강화 학습"의 두 판 사이의 차이
		
		
		
		
		
		둘러보기로 가기
		검색하러 가기
		
				
		
		
	
Pythagoras0 (토론 | 기여)  (→노트:  새 문단)  | 
				Pythagoras0 (토론 | 기여)   | 
				||
| (같은 사용자의 중간 판 2개는 보이지 않습니다) | |||
| 3번째 줄: | 3번째 줄: | ||
# Reinforcement learning Supervised learning Reinforcement learning is all about making decisions sequentially.<ref name="ref_19fe">[https://www.geeksforgeeks.org/what-is-reinforcement-learning/ Reinforcement learning]</ref>  | # Reinforcement learning Supervised learning Reinforcement learning is all about making decisions sequentially.<ref name="ref_19fe">[https://www.geeksforgeeks.org/what-is-reinforcement-learning/ Reinforcement learning]</ref>  | ||
# In Reinforcement Learning (RL), agents are trained on a reward and punishment mechanism.<ref name="ref_abf4">[https://neptune.ai/blog/reinforcement-learning-applications 10 Real-Life Applications of Reinforcement Learning]</ref>  | # In Reinforcement Learning (RL), agents are trained on a reward and punishment mechanism.<ref name="ref_abf4">[https://neptune.ai/blog/reinforcement-learning-applications 10 Real-Life Applications of Reinforcement Learning]</ref>  | ||
| − | |||
# Using reinforcement learning, AlphaGo Zero was able to learn the game of Go from scratch.<ref name="ref_abf4" />  | # Using reinforcement learning, AlphaGo Zero was able to learn the game of Go from scratch.<ref name="ref_abf4" />  | ||
# In this paper, the authors propose real-time bidding with multi-agent reinforcement learning.<ref name="ref_abf4" />  | # In this paper, the authors propose real-time bidding with multi-agent reinforcement learning.<ref name="ref_abf4" />  | ||
| 37번째 줄: | 36번째 줄: | ||
===소스===  | ===소스===  | ||
  <references />  |   <references />  | ||
| + | |||
| + | ==메타데이터==  | ||
| + | ===위키데이터===  | ||
| + | * ID :  [https://www.wikidata.org/wiki/Q830687 Q830687]  | ||
| + | ===Spacy 패턴 목록===  | ||
| + | * [{'LOWER': 'reinforcement'}, {'LEMMA': 'learning'}]  | ||
2021년 2월 17일 (수) 00:30 기준 최신판
노트
- Reinforcement learning Supervised learning Reinforcement learning is all about making decisions sequentially.[1]
 - In Reinforcement Learning (RL), agents are trained on a reward and punishment mechanism.[2]
 - Using reinforcement learning, AlphaGo Zero was able to learn the game of Go from scratch.[2]
 - In this paper, the authors propose real-time bidding with multi-agent reinforcement learning.[2]
 - Reinforcement Learning is one of the hottest research topics currently and its popularity is only growing day by day.[3]
 - As compared to unsupervised learning, reinforcement learning is different in terms of goals.[3]
 - Reinforcement learning solves the difficult problem of correlating immediate actions with the delayed returns they produce.[4]
 - Thus, video games provide the sterile environment of the lab, where ideas about reinforcement learning can be tested.[4]
 - But convolutional networks derive different interpretations from images in reinforcement learning than in supervised learning.[4]
 - At the beginning of reinforcement learning, the neural network coefficients may be initialized stochastically, or randomly.[4]
 - This article pursues to highlight in a non-exhaustive manner the main type of algorithms used for reinforcement learning (RL).[5]
 - Reinforcement learning is the training of machine learning models to make a sequence of decisions.[6]
 - In reinforcement learning, an artificial intelligence faces a game-like situation.[6]
 - By leveraging the power of search and many trials, reinforcement learning is currently the most effective way to hint machine’s creativity.[6]
 - Training the models that control autonomous cars is an excellent example of a potential application of reinforcement learning.[6]
 - Reinforcement learning (RL) is learning by interacting with an environment.[7]
 - Reinforcement learning is also reflected at the level of neuronal sub-systems or even at the level of single neurons.[7]
 - However, only few dopaminergic neurons produce error signals that comply with the demands of reinforcement learning.[7]
 - In this post, we are gonna briefly go over the field of Reinforcement Learning (RL), from fundamental concepts to classic algorithms.[8]
 - After knowing these, it is pretty hard not to be curious about the magic behind these algorithms — Reinforcement Learning (RL).[8]
 - Dueling network architectures for deep reinforcement learning.[8]
 - You need to remember that Reinforcement Learning is computing-heavy and time-consuming.[9]
 - Thus, reinforcement learning is particularly well-suited to problems that include a long-term versus short-term reward trade-off.[10]
 - In inverse reinforcement learning (IRL), no reward function is given.[10]
 - This page contains Reinforcement Learning glossary terms.[11]
 - In reinforcement learning, the world that contains the agent and allows the agent to observe that world's state.[11]
 - In reinforcement learning, a policy that either follows a random policy with epsilon probability or a greedy policy otherwise.[11]
 - In reinforcement learning, a DQN technique used to reduce temporal correlations in training data.[11]
 - This paradigm of learning by trial-and-error, solely from rewards or punishments, is known as reinforcement learning (RL).[12]
 - Two years ago we introduced the first widely successful algorithm for deep reinforcement learning.[12]
 - In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the field's key ideas and algorithms.[13]
 - The final chapter discusses the future societal impacts of reinforcement learning.[13]
 - Reinforcement learning is the study of decision making over time with consequences.[14]
 
소스
- ↑ Reinforcement learning
 - ↑ 2.0 2.1 2.2 10 Real-Life Applications of Reinforcement Learning
 - ↑ 3.0 3.1 5 Things You Need to Know about Reinforcement Learning
 - ↑ 4.0 4.1 4.2 4.3 A Beginner's Guide to Deep Reinforcement Learning
 - ↑ Reinforcement Learning algorithms — an intuitive overview
 - ↑ 6.0 6.1 6.2 6.3 What is reinforcement learning? The complete guide
 - ↑ 7.0 7.1 7.2 Reinforcement learning
 - ↑ 8.0 8.1 8.2 A (Long) Peek into Reinforcement Learning
 - ↑ Reinforcement Learning: What is, Algorithms, Applications, Example
 - ↑ 10.0 10.1 Reinforcement learning
 - ↑ 11.0 11.1 11.2 11.3 Machine Learning Glossary: Reinforcement Learning
 - ↑ 12.0 12.1 Deep Reinforcement Learning
 - ↑ 13.0 13.1 Reinforcement Learning, Second Edition
 - ↑ Reinforcement Learning
 
메타데이터
위키데이터
- ID : Q830687
 
Spacy 패턴 목록
- [{'LOWER': 'reinforcement'}, {'LEMMA': 'learning'}]