Reinforcement Learning with Stable-Baselines and SAC algorithm

I would like to have simple a custom RL agent using SAC from Stable-baselines.

Tasks are as follow:

1. Design a custom environment (I will help with the details)

2. Train the agent and save the policy

3. Load the saved policy and test the agent to get the actions

4. You should use the Stable-baselines callback to save the best model

The project is considered "completed" if and only if all 4 tasks have been completed.

Aptitudini: Python, Machine Learning (ML)

Despre angajator:
( 3 recenzii ) Suwon, Korea, Republic of

ID Proiect: #32764364

8 freelanceri licitează în medie 166$ pentru acest proiect

(104 recenzii)

☀️☀️☀️☀️☀️ Hello , I hope you are safe and Doing well I have seen your project requirements , I am looking to discuss further with you Hope we will meet soon to discuss further ☀️☀️☀️☀️☀️ ⚡️⚡️ Coming to me, I'm a Data Mai multe

%bids___i_sum_sub_35%%project_currencyDetails_sign_sub_36% USD în 7 zile
(48 recenzii)

Greetings. I checked your Machine Learning and Datamining project requirements. Your work is one of the tasks that can be done very perfectly by us. I will work within your budget, Within your deadline. We are good in Mai multe

%bids___i_sum_sub_35%%project_currencyDetails_sign_sub_36% USD în 2 zile
(20 recenzii)

Hi, I have +5 years of experience dealing with machine learning algorithms and worked on multiple projects in this field, I absolutely can do your project as you like. Please contact me to discuss more. Have a nice day

%bids___i_sum_sub_35%%project_currencyDetails_sign_sub_36% USD în 7 zile
(10 recenzii)
(4 recenzii)
(3 recenzii)

Hello client I am a professional Python developer with 7+ years of experience in Python such as web scraping, bot, Flask , Django and Machine Learning, NLP, Deep Learning, Artificial Intelligence , Tensor Overflow, CNN Mai multe

%bids___i_sum_sub_35%%project_currencyDetails_sign_sub_36% USD în 7 zile
(1 părere)

Hi, I'm a senior PhD student studying (Deep) Reinforcement Learning algorithms and applications. I've been creating my own custom environments and use stable-baselines as well for most of my experiments and papers, so Mai multe

%bids___i_sum_sub_35%%project_currencyDetails_sign_sub_36% USD în 2 zile
(0 recenzii)