Guiding evolutionary strategies by (inaccurate) differentiable robot simulators @ NeurIPS, 4th Robot Learning Workshop

Last update: Dec 14, 2021

Overview

Guiding Evolutionary Strategies by Differentiable Robot Simulators

In recent years, Evolutionary Strategies were actively explored in robotic tasks for policy search as they provide a simpler alternative to reinforcement learning algorithms. However, this class of algorithms is often claimed to be extremely sample-inefficient. On the other hand, there is a growing interest in Differentiable Robot Simulators (DRS) as they potentially can find successful policies with only a handful of trajectories. But the resulting gradient is not always useful for the first-order optimization. In this work, we demonstrate how DRS gradient can be used in conjunction with Evolutionary Strategies. Preliminary results suggest that this combination can reduce sample complexity of Evolutionary Strategies by 3x-5x times in both simulation and the real world.

To appear in 4th Robot Learning Workshop: Self-Supervised and Lifelong Learning

Paper -- Video -- Poster

Citation

Please use the following Bibtex entry:

@misc{kurenkov2021guiding,
      title={Guiding Evolutionary Strategies by Differentiable Robot Simulators}, 
      author={Vladislav Kurenkov and Bulat Maksudov},
      year={2021},
      eprint={2110.00438},
      archivePrefix={arXiv},
      primaryClass={cs.RO}
}

Guiding evolutionary strategies by (inaccurate) differentiable robot simulators @ NeurIPS, 4th Robot Learning Workshop

Related tags

Overview

Guiding Evolutionary Strategies by Differentiable Robot Simulators

Citation

Owner

Vladislav Kurenkov

DrWhy is the collection of tools for eXplainable AI (XAI). It's based on shared principles and simple grammar for exploration, explanation and visualisation of predictive models.

Code for BMVC2021 "MOS: A Low Latency and Lightweight Framework for Face Detection, Landmark Localization, and Head Pose Estimation"

Experiments and examples converting Transformers to ONNX

[AAAI-2021] Visual Boundary Knowledge Translation for Foreground Segmentation

Official implementation of the paper "AAVAE: Augmentation-AugmentedVariational Autoencoders"

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

This MVP data web app uses the Streamlit framework and Facebook's Prophet forecasting package to generate a dynamic forecast from your own data.

Augmented CLIP - Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.

All-in-one Docker container that allows a user to explore Nautobot in a lab environment.

CAPRI: Context-Aware Interpretable Point-of-Interest Recommendation Framework

[ECCV 2020] XingGAN for Person Image Generation

[ICRA2021] Reconstructing Interactive 3D Scene by Panoptic Mapping and CAD Model Alignment

Storchastic is a PyTorch library for stochastic gradient estimation in Deep Learning

🥇 LG-AI-Challenge 2022 1위 솔루션 입니다.

Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)

A reimplementation of DCGAN in PyTorch

A clean and scalable template to kickstart your deep learning project 🚀 ⚡ 🔥

Only a Matter of Style: Age Transformation Using a Style-Based Regression Model

Online Multi-Granularity Distillation for GAN Compression (ICCV2021)

Keras implementations of Generative Adversarial Networks.