The technology developed in this invention, Fitted Q-Iteration with Complex Returns (CFQI), enables an autonomous system to learn how to perform a complex task from past experience

About

The technology developed in this invention, Fitted Q-Iteration with Complex Returns (CFQI), enables an autonomous system to learn how to perform a complex task from past experiences through reinforcement learning – learning from feedback without human intervention. Successful application of this technology will not only make it easier and faster to build general-purpose autonomous systems, but also enable an autonomous system to continuously improve its performance and adapt to new and dynamic environments.

Key Benefits

• Less dependent on human intervention (e.g., anticipating the operational environment and hard coding rules of operations and learning) in teaching a system how to perform a task, and therefore, less human cost, and faster to develop a system. • Ability to adapt to new, uncertain, dynamic environment without human intervention (e.g., specifically instructed by a human expert to change its course of actions under a new condition) • Forever learning capability - the longer a system is deployed, the more experience it gets, and the better it performs. • Better sample efficiency than FQI – achieving the same level of policy performance using significantly less samples. • Better computational efficiency than FQI – achieving the same level of policy performance with significantly less computation time. • Better effectiveness than TFQI (an earlier extension to FQI) – much more reliable in producing improved policy performance over FQI given the same set of samples.

Register for free for full unlimited access to all innovation profiles on LEO

  • Discover articles from some of the world’s brightest minds, or share your thoughts and add one yourself
  • Connect with like-minded individuals and forge valuable relationships and collaboration partners
  • Innovate together, promote your expertise, or showcase your innovations