Skip to content
Yair Shinar
Leadership and Software
Primary Navigation Menu
Menu
  • Home
  • Portfolio
    • Live Projects
    • Indoor Navigation in Mall
    • Indoor Navigation in Building
    • Race Track Localization
    • Planting Commercial on Buildings
    • Navigation & Motion Estimation for Outdoor
    • Various
  • Videos – Instructional
  • Algorithm & Other Snippets
  • Programming Solutions
    • Algorithms
    • Autonomous Vehicles
    • AI
    • Database
    • Science
    • Back End Server Side/ Desktop
    • Operating System
    • Front End Web Development
    • Security
    • Cloud
  • Skills
    • Management
    • Sciences Proficiency
      • Mathematics
      • Physics
      • Computer Science
      • Statistics
    • Work Experience
      • Data Scientist
      • Computer Vision
      • Autonomous
      • Algorithm
      • Python
  • CV
  • Mini-Projects
  • Management

Search

Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors

reward

Q-learning – Model-free reinforcement learning algorithm

2019-07-06
On: July 6, 2019
In: AI, Algorithms, Deep Machine Learning, Reinforcement Learning

Learns a policy which tells an agent what action to take under what circumstances. Q-learning learns a policy that is optimal in the sense that it maximizes the expected value of the total reward over any and all successive steps, starting from the current state.Continue Reading

Markov decision process (MDP) – Reinforcement Learning decision model

2019-07-06
On: July 6, 2019
In: AI, Deep Machine Learning, Reinforcement Learning

* Is a discrete time stochastic control process for decision making in situations where outcomes are partly random and partly under the control of a decision maker. * At each discrete time step, the process is in some state s, and the decision maker may choose any action a thatContinue Reading

Yair Shinar for Clarity and Solutions

Scroll Up