Find Jobs
Hire Freelancers

Multi-Critic DDPG Method and Double Experience Replay

₹12500-37500 INR

已关闭
已发布超过 1 年前

₹12500-37500 INR

货到付款
Abstract—The remarkable Deep Deterministic Policy Gradient (DDPG) reinforcement learning method commonly consists of actor learning and critic learning. The actor learning highly relies on the critic learning, which makes the performance of DDPG method rather sensitive to critic learning and leads to stability issues. To further improve the stability and performance of DDPG method, the multi-critic DDPG method (MCDDPG) is proposed for a reliable critic learning. The average value of multiple critics is used to replace the single critic in DDPG method for better resistance when one critic performs badly, and multiple independent critics can learn knowledges from environment more widely. Besides, an extension of experience replay mechanism is revealed for accelerating the training process. All the methods are tested on simulated environments in OpenAI Gym platform, and convincing experiment results should be obtained to support the proposed methods. (we can use python instead of OpenAI)
项目 ID: 35666847

关于此项目

6提案
远程项目
活跃1 年前

想赚点钱吗?

在Freelancer上竞价的好处

设定您的预算和时间范围
为您的工作获得报酬
简要概述您的提案
免费注册和竞标工作
6威客以平均价₹25,017 INR来参与此工作竞价
用户头像
Hello. Your projects seem interesting to me!! I have understood your issue. I am a well-experienced python developer so that I can complete this project. As you can see my profile, I have also rich experience with Image processing, AI and ML. I will do my best and your task will be first for me. Let’s discuss about more details in chat. Hope to get reply from you soon. Best regards.
₹25,000 INR 在7天之内
4.9 (26条评论)
5.3
5.3
用户头像
Greetings, I have gone through your project description. I find myself as a perfect fit for this job. I am working as a Python Developer from last 2 year. Some of my expertise is in the fields: 1. Web Scraping/Web Automation - Selenium, Scrapy, Requests, Beautifulsoup 2. AI and ML 3. Web Designing 4. Wordpress 5. Data Science 6. C/C++ 7. SQL I will be available 24/7 to assist you during the project. So lets discuss more about it over chat. Yours Faithfully, Jaibhan Singh Gaur,
₹12,500 INR 在3天之内
5.0 (61条评论)
5.1
5.1
用户头像
I would approach this project by first understanding the client's goals. I would then conduct research on the current methods used and relevant literature to gain an understanding of the issue. After that, I would design a program in Python to implement the proposed method. I would then test the program on simulated environments to ensure that it works as intended. Finally, I would present the results of the tests with a detailed report to the client.
₹25,000 INR 在7天之内
5.0 (4条评论)
4.8
4.8
用户头像
Hi! I'm 2 years experienced Machine Learning Engineer and Data Science Trainer ready to work for you in an efficient way with strong Data Science and Machine learning background. My Core Skills are: Machine learning - Classification, Regression, Association Rule Mining, Clustering & Dimensionality Reduction, Natural Language Processing, TensorFlow, Reinforcement learning, Reward-Based Models. Languages: Python, R, KNIME
₹37,600 INR 在1天之内
4.6 (3条评论)
3.4
3.4

关于客户

INDIA的国旗
Davangere, India
0.0
0
会员自12月 2, 2022起

客户认证

谢谢!我们已通过电子邮件向您发送了索取免费积分的链接。
发送电子邮件时出现问题。请再试一次。
已注册用户 发布工作总数
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
加载预览
授予地理位置权限。
您的登录会话已过期而且您已经登出,请再次登录。