PySurvival is an open source python package for Survival Analysis modeling

Last update: Dec 27, 2022

Overview

PySurvival

What is Pysurvival ?

PySurvival is an open source python package for Survival Analysis modeling - the modeling concept used to analyze or predict when an event is likely to happen. It is built upon the most commonly used machine learning packages such NumPy, SciPy and PyTorch.

PySurvival is compatible with Python 2.7-3.7.

Check out the documentation here

Content

PySurvival provides a very easy way to navigate between theoretical knowledge on Survival Analysis and detailed tutorials on how to conduct a full analysis, build and use a model. Indeed, the package contains:

10+ models ranging from the Cox Proportional Hazard model, the Neural Multi-Task Logistic Regression to Random Survival Forest
Summaries of the theory behind each model as well as API descriptions and examples.
Tutorials displaying in great details how to perform exploratory data analysis, survival modeling, cross-validation and prediction, for churn modeling and credit risk to name a few.
Performance metrics to assess the models' abilities like c-index or brier score
Simple ways to load and save models
... and more !

Installation

If you have already installed a working version of gcc, the easiest way to install Pysurvival is using pip

pip install pysurvival

The full description of the installation steps can be found here.

Get Started

Because of its simple API, Pysurvival has been built to provide to best user experience when it comes to modeling. Here's a quick modeling example to get you started:

# Loading the modules
from pysurvival.models.semi_parametric import CoxPHModel
from pysurvival.models.multi_task import LinearMultiTaskModel
from pysurvival.datasets import Dataset
from pysurvival.utils.metrics import concordance_index

# Loading and splitting a simple example into train/test sets
X_train, T_train, E_train, X_test, T_test, E_test = \
	Dataset('simple_example').load_train_test()

# Building a CoxPH model
coxph_model = CoxPHModel()
coxph_model.fit(X=X_train, T=T_train, E=E_train, init_method='he_uniform', 
                l2_reg = 1e-4, lr = .4, tol = 1e-4)

# Building a MTLR model
mtlr = LinearMultiTaskModel()
mtlr.fit(X=X_train, T=T_train, E=E_train, init_method = 'glorot_uniform', 
           optimizer ='adam', lr = 8e-4)

# Checking the model performance
c_index1 = concordance_index(model=coxph_model, X=X_test, T=T_test, E=E_test )
print("CoxPH model c-index = {:.2f}".format(c_index1))

c_index2 = concordance_index(model=mtlr, X=X_test, T=T_test, E=E_test )
print("MTLR model c-index = {:.2f}".format(c_index2))

Citation and License

Citation

If you use Pysurvival in your research and we would greatly appreciate if you could use the following:

@Misc{ pysurvival_cite,
  author =    {Stephane Fotso and others},
  title =     {PySurvival: Open source package for Survival Analysis modeling},
  year =      {2019--},
  url = "https://www.pysurvival.io/"
}

License

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

PySurvival is an open source python package for Survival Analysis modeling

Related tags

Overview

PySurvival

What is Pysurvival ?

Content

Installation

Get Started

Citation and License

Citation

License

Owner

Square

Turning images into '9-pan' palettes using KMeans clustering from sklearn.

A mindmap summarising Machine Learning concepts, from Data Analysis to Deep Learning.

This is an implementation of the proximal policy optimization algorithm for the C++ API of Pytorch

Scikit-Garden or skgarden is a garden for Scikit-Learn compatible decision trees and forests.

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Implementation of deep learning models for time series in PyTorch.

Predict the output which should give a fair idea about the chances of admission for a student for a particular university

mlpack: a scalable C++ machine learning library --

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

Python/Sage Tool for deriving Scattering Matrices for WDF R-Adaptors

Crypto-trading - ML techiques are used to forecast short term returns in 14 popular cryptocurrencies

Required for a machine learning pipeline data preprocessing and variable engineering script needs to be prepared

Cool Python features for machine learning that I used to be too afraid to use. Will be updated as I have more time / learn more.

🎛 Distributed machine learning made simple.

Relevance Vector Machine implementation using the scikit-learn API.

#30DaysOfStreamlit is a 30-day social challenge for you to build and deploy Streamlit apps.

LILLIE: Information Extraction and Database Integration Using Linguistics and Learning-Based Algorithms

Flask app to predict daily radiation from the time series of Solcast from Islamabad, Pakistan

An AutoML survey focusing on practical systems.

GroundSeg Clustering Optimized Kdtree