Proofs of Reinforcement Learning Algorithms

Translate This Thread From English to

Threaded View
Hi,

I'm suggesting a new reinforcement learning algorithm applied in my
case on robots, please see:
http://www.compactech.com/kartoun/articles/html/Kartoun_RA_2005_September_3_2005_Accepted.htm

I would like to describe the algorithm more scientifically; define it
mathematically much better than described in the paper.

I'm asking for guidance of how to prove an algorithm, for example in
the form of convergence or superiority. How can I demonstrate
advantages or disadvantages of an algorithm mathematically? How can I
prove convergence or divergence? How can I show if it is better or
worse than other algorithms?

I've already tested the algorithm on a mobile robot for navigation,
please see:

http://www.compactech.com/kartoun/videos/Uri_Kartoun_CQ (lamda)_Reinforcment_Learning_August_7_2005.wmv

I intend applying it for the task of finding optimal grasping, lifting
and shaking policies of suspicious bags (contain anthrax, Ebola
microbes or SARS), please see an initial experiment:

http://www.compactech.com/kartoun/videos/Uri_Kartoun_Plastic_Bag_Experiment_January_2_2006.wmv

Thanks a lot!

Uri Kartoun.

http://www.compactech.com/kartoun/


Re: Proofs of Reinforcement Learning Algorithms


http://www.compactech.com/kartoun/articles/html/Kartoun_RA_2005_September_3_2005_Accepted.htm

This question would be better directed to comp.ai.

Chris

Re: Proofs of Reinforcement Learning Algorithms

Thanks Chris. I just submitted it to comp.ai.
Uri.


Re: Proofs of Reinforcement Learning Algorithms

No, it would not be better directed there.
Robots, once given motors and sensors need intelligence.

What would be helpful is an english translation for the mathematically
stunted like myself.





http://www.compactech.com/kartoun/articles/html/Kartoun_RA_2005_September_3_2005_Accepted.htm


Re: Proofs of Reinforcement Learning Algorithms

blueeyedpop, I'm going to enhance and explain the algorithm better in
future papers (near future).
Thanks,
-U.


Re: Proofs of Reinforcement Learning Algorithms

This is also related:

http://groups.google.com/group/comp.ai.games/browse_thread/thread/8198fde98af821af/54e2842be5d17610?lnk=st&q=kartoun&rnum=2#54e2842be5d17610

Thanks,

Uri.


Re: Proofs of Reinforcement Learning Algorithms


Robots undoubtedly require intelligence. However, in the several years
I've followed this forum, I can count the number of substantive AI posts
on one hand. That's not to say we aren't interested in AI. This forum
just seems more geared towards the mechanics of robotics.

Chris

Re: Proofs of Reinforcement Learning Algorithms

CRM is just that, miscellaneous robotics.

The profusion of mechanical stuff in the group pertains to the availability
of outside resources specific to processors and such.

"back in the day", there were a lot of posts, at a lot of different levels.

If Kartoun could digest his data into bits that math morons like myself can
grasp, it would be a tremendous contribution to this group.

Mike

It has been the subject of concern as to why it is slow, compared to it's
heydey of a decade ago


Re: Proofs of Reinforcement Learning Algorithms

O.K. I believe that I should post mostly messages related to mechanics.
Like that:

http://www.compactech.com/kartoun/videos/Uri_Kartoun_ER1_Self_Docking_November_2004.wmv

Uri.


Re: Proofs of Reinforcement Learning Algorithms

NOOOOOOOOOOOOOOOOO!
see the FAQ

http://www.faqs.org/faqs/robotics-faq/part1/



http://www.compactech.com/kartoun/videos/Uri_Kartoun_ER1_Self_Docking_November_2004.wmv


Re: Proofs of Reinforcement Learning Algorithms



http://www.compactech.com/kartoun/videos/Uri_Kartoun_ER1_Self_Docking_November_2004.wmv

Too long... better to summarize it...


Re: Proofs of Reinforcement Learning Algorithms

CRM is for anything robotic. Please keep posting anything remotely
interesting, unless it is spam.



http://www.compactech.com/kartoun/videos/Uri_Kartoun_ER1_Self_Docking_November_2004.wmv


Re: Proofs of Reinforcement Learning Algorithms

That's what I'm trying to do.
Uri.


Site Timeline