A PyTorch Implementation of Single Shot Scale-invariant Face Detector.

Last update: Jan 07, 2023

Related tags

Overview

S³FD: Single Shot Scale-invariant Face Detector

A PyTorch Implementation of Single Shot Scale-invariant Face Detector.

Eval

python wider_eval_pytorch.py

cd eval/eval_tools_old-version
octave wider_eval_pytorch.m

Model

s3fd_convert.7z

Test

python test.py --model data/s3fd_convert.pth --path data/test01.jpg

References

SFD

Comments

RGB <-> BGR

From this line, I assume you use RGB: img = img - np.array([104,117,123])

However opencv uses BGR, so this line returns BGR: if args.path=='CAMERA': ret, img = cap.read()

Then BGR is fed to the network bboxlist = detect(net,img)

I fed RGB to the network and got worse results. Is it possible that you meant RGB in all places but the network is actually trained for BGR? (If then it should be img = img - np.array([123,117,104]))

opened by elbaro 3
How Convert Weights

Dear @clcarwin, Thank you for your nice work. Would you please tell me how you can convert Caffe weights and model of S3FD into PyTorch? Can you convert the model & pre-trained weights of RefineDet into PyTorch?

opened by ahkarami 2
evaluation accuracy is not good as the original paper

hi @clcarwin,

I test you evaluation results on wider face as (easy 92.8, medium 91.5, hard 84.2). But with the original model provided by sfzhang15/SFD, I can get (easy 93.8, medium 92.4, hard 85.1).

Did I test correctly? If so, why there is accuracy loss?

Great work! Best,

opened by marvis 2
'float' object cannot be interpreted as an integer??

Sir,I'm sorry to disturb you about this object. I run this object on windows 10,python 3.5.2 ,pytorch 0.3. After : python test.py --model data/s3fd_convert.pth --path data/test01.jpg, the screen display: D:\Python\Pytorch_cw_sfd\SFD_pytorch>python test.py --model data/s3fd_convert.pth --path data/test01.jpg Traceback (most recent call last): File "test.py", line 71, in bboxlist = detect(net,img) File "test.py", line 27, in detect for i in range(len(olist)/2): olist[i2] = F.softmax(olist[i2]) TypeError: 'float' object cannot be interpreted as an integer

Why ???

opened by door5719 1
padding size of fc6

Hi @clcarwin,

Why do you set the padding size of fc6 to 3? This is inconsistent with the original paper. See https://github.com/clcarwin/SFD_pytorch/blob/master/net_s3fd.py#L42

Best,

opened by marvis 1
Optimization

Good: It is accurate.

Bad: The inference time is more than 80 ms for realtime usage. To make it work for realtime image has to be resized to less than 200x200 which reduces accuracy.

So in order to make it usable the only way is to make it faster. Have you tried using TensorRT or TVM or Pytorch serving in C++ ?

opened by jamessmith90 0
Several speed & code updates

Seems nobody's looking at PR's here, but letting others know I've made a number of improvements.

It runs smoothly on modern pytorch (1.3) and refactored the code to eliminate redundant code. I also added some convenient methods that make it easier to do common things, like detect_faces. Also, added integration tests.

I independently found the same speed-up as @kir-dan in https://github.com/clcarwin/SFD_pytorch/pull/4 and moved all that code into pytorch instead of numpy, so it can be fully run on GPU.

opened by leopd 0
Very high GPU memory usage

Hi, I have been running the model using test.py and modified it run multiple files. The GPU memory keeps on increasing,from 3gigs to 9 gigs. Is this due to poor garbage collection?

opened by vaishnavm217 2
Change Anchor Boxes Aspect Ratio

Dear @clcarwin, If one wants to change the aspect ratio of anchor boxes, must just changed the detect method in test.py? For example, line https://github.com/clcarwin/SFD_pytorch/blob/96fdfbe22eef176a04802d915834b82a131a854d/test.py#L39 or other methods moreover must changed?

opened by ahkarami 0
About data augmentation

When I use the Tensorflow to build the project, I have some trouble in data augmentation which describe in the paper. Can you tell the details of the data augmentation or show your data augmentation code to me. Thank you

opened by ckqsars 0

Releases(v0.1)

v0.1(Nov 21, 2017)

Source code(tar.gz)
Source code(zip)
s3fd_convert.7z(8.14 MB)

Owner

carwin

GitHub Repository

Code for the paper: On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations

Non-Parametric Prior Actor-Critic (N-PPAC) This repository contains the code for On Pathologies in KL-Regularized Reinforcement Learning from Expert D

5 May 13, 2022

FFCV: Fast Forward Computer Vision (and other ML workloads!)

Fast Forward Computer Vision: train models at a fraction of the cost with accele

2.3k Jan 03, 2023

Image-Adaptive YOLO for Object Detection in Adverse Weather Conditions

Image-Adaptive YOLO for Object Detection in Adverse Weather Conditions Accepted by AAAI 2022 [arxiv] Wenyu Liu, Gaofeng Ren, Runsheng Yu, Shi Guo, Jia

245 Dec 16, 2022

X-VLM: Multi-Grained Vision Language Pre-Training

X-VLM: learning multi-grained vision language alignments Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts. Yan Zeng, Xi

286 Dec 23, 2022

Apply AnimeGAN-v2 across frames of a video clip

title emoji colorFrom colorTo sdk app_file pinned AnimeGAN-v2 For Videos 🔥 blue red gradio app.py false AnimeGAN-v2 For Videos Apply AnimeGAN-v2 acro

36 Oct 18, 2022

Pgn2tex - Scripts to convert pgn files to latex document. Useful to build books or pdf from pgn studies

Pgn2Latex (WIP) A simple script to make pdf from pgn files and studies. It's sti

12 Jul 23, 2022

Distributed Asynchronous Hyperparameter Optimization in Python

Hyperopt: Distributed Hyperparameter Optimization Hyperopt is a Python library for serial and parallel optimization over awkward search spaces, which

6.5k Jan 01, 2023

simple_pytorch_example project is a toy example of a python script that instantiates and trains a PyTorch neural network on the FashionMNIST dataset

1 Jan 07, 2022

A Python package to create, run, and post-process MODFLOW-based models.

Version 3.3.5 — release candidate Introduction FloPy includes support for MODFLOW 6, MODFLOW-2005, MODFLOW-NWT, MODFLOW-USG, and MODFLOW-2000. Other s

388 Nov 29, 2022

Self-Supervised Learning with Kernel Dependence Maximization

Self-Supervised Learning with Kernel Dependence Maximization This is the code for SSL-HSIC, a self-supervised learning loss proposed in the paper Self

29 Dec 29, 2022

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth [Paper]

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth [Paper] Downloads [Downloads] Trained ckpt files for NYU Depth V2 and

98 Jan 01, 2023

Relative Uncertainty Learning for Facial Expression Recognition

Relative Uncertainty Learning for Facial Expression Recognition The official implementation of the following paper at NeurIPS2021: Title: Relative Unc

35 Dec 28, 2022

Gans-in-action - Companion repository to GANs in Action: Deep learning with Generative Adversarial Networks

GANs in Action by Jakub Langr and Vladimir Bok List of available code: Chapter 2: Colab, Notebook Chapter 3: Notebook Chapter 4: Notebook Chapter 6: C

914 Dec 21, 2022

Cockpit is a visual and statistical debugger specifically designed for deep learning.

Cockpit: A Practical Debugging Tool for Training Deep Neural Networks

421 Dec 29, 2022

Investigating automatic navigation towards standard US views integrating MARL with the virtual US environment developed in CT2US simulation

AutomaticUSnavigation Investigating automatic navigation towards standard US views integrating MARL with the virtual US environment developed in CT2US

6 Dec 05, 2022

A PyTorch Implementation of Single Shot Scale-invariant Face Detector.

Related tags

Overview

S³FD: Single Shot Scale-invariant Face Detector

Eval

Model

Test

References

Comments

Releases(v0.1)

v0.1(Nov 21, 2017)

Owner

carwin

Code for the paper: On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations

FFCV: Fast Forward Computer Vision (and other ML workloads!)

Image-Adaptive YOLO for Object Detection in Adverse Weather Conditions

X-VLM: Multi-Grained Vision Language Pre-Training

Apply AnimeGAN-v2 across frames of a video clip

Pgn2tex - Scripts to convert pgn files to latex document. Useful to build books or pdf from pgn studies

Distributed Asynchronous Hyperparameter Optimization in Python

simple_pytorch_example project is a toy example of a python script that instantiates and trains a PyTorch neural network on the FashionMNIST dataset

A Python package to create, run, and post-process MODFLOW-based models.

Self-Supervised Learning with Kernel Dependence Maximization

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth [Paper]

Relative Uncertainty Learning for Facial Expression Recognition

Gans-in-action - Companion repository to GANs in Action: Deep learning with Generative Adversarial Networks

Cockpit is a visual and statistical debugger specifically designed for deep learning.

Investigating automatic navigation towards standard US views integrating MARL with the virtual US environment developed in CT2US simulation

DyNet: The Dynamic Neural Network Toolkit

Evaluation suite for large-scale language models.

SpeechBrain is an open-source and all-in-one speech toolkit based on PyTorch.

Real-time pose estimation accelerated with NVIDIA TensorRT

Thermal Control of Laser Powder Bed Fusion using Deep Reinforcement Learning