Self-driving car env with PPO algorithm from stable baseline3

Last update: Dec 22, 2022

Related tags

Deep Learning Self-Driving-car

Overview

Self-driving car with RL stable baseline3

Most of the project develop from https://github.com/GerardMaggiolino/Gym-Medium-Post Please check it out!

This project focus on training self-driving car env by implementing PPO algorithm from stable baseline3

Installation

Clone the project

git clone https://github.com/SornsiriP/Self-Driving-car

Then run Gym-Medium-Post/main.py

Update

Wrap env to change observation space from box to RGB image

from simple_driving.resources.wrapper import ProcessFrame84

env = ProcessFrame84(env)

Using PPO with CNN policy instead of TRPO

from stable_baselines3 import PPO

model = PPO('CnnPolicy', env, verbose=1,learning_rate = 0.00025,tensorboard_log="./Simple-driving/",n_steps=10000,batch_size=1000,gamma=0.9995)
model.learn(total_timesteps=150000)

Normalize action space

def map_action(self, action):
  speed_range = [0,1]
  steer_range = [-0.6,0.6]
  new_speed = np.interp(action[0],[-1,1],speed_range)
  new_steer = np.interp(action[0],[-1,1],steer_range)
  return [new_speed, new_steer]

Add limited timestep reset condition

if self.current_step >1000:
  self.current_step = 0
  self.done = True

Normalize distance in reward function

previous_dist_to_goal = np.linalg.norm(tuple(map(lambda i, j: i - j, self.goal, self.prev_pos)))
current_dist_to_goal =  np.linalg.norm(tuple(map(lambda i, j: i - j, self.goal, car_ob[0:2])))

Reference

https://github.com/GerardMaggiolino/Gym-Medium-Post

https://www.etedal.net/2020/04/pybullet-panda_3.html

Contributing

Sornsiri Promma

Thanks original project from Gerard Maggiolino

Please make sure to update tests as appropriate.

Self-driving car env with PPO algorithm from stable baseline3

Related tags

Overview

Self-driving car with RL stable baseline3

Installation

Update

Reference

Contributing

Owner

Sornsiri.P

A set of tools to pre-calibrate and calibrate (multi-focus) plenoptic cameras (e.g., a Raytrix R12) based on the libpleno.

Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)

Repository for the Bias Benchmark for QA dataset.

RaftMLP: How Much Can Be Done Without Attention and with Less Spatial Locality?

Abstractive opinion summarization system (SelSum) and the largest dataset of Amazon product summaries (AmaSum). EMNLP 2021 conference paper.

Implementation for "Conditional entropy minimization principle for learning domain invariant representation features"

Source code and notebooks to reproduce experiments and benchmarks on Bias Faces in the Wild (BFW).

Type4Py: Deep Similarity Learning-Based Type Inference for Python

The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".

Deep Hedging Demo - An Example of Using Machine Learning for Derivative Pricing.

Sparse-dense operators implementation for Paddle

To build a regression model to predict the concrete compressive strength based on the different features in the training data.

Official Pytorch implementation of 6DRepNet: 6D Rotation representation for unconstrained head pose estimation.

[ICCV 2021 Oral] SnowflakeNet: Point Cloud Completion by Snowflake Point Deconvolution with Skip-Transformer

Vertex AI: Serverless framework for MLOPs (ESP / ENG)

frida工具的缝合怪

Continuum Learning with GEM: Gradient Episodic Memory

Code and data for "TURL: Table Understanding through Representation Learning"

This computer program provides a reference implementation of Lagrangian Monte Carlo in metric induced by the Monge patch

This repository contains the code for TABS, a 3D CNN-Transformer hybrid automated brain tissue segmentation algorithm using T1w structural MRI scans