Newer
Older
> The Poisson lognormal model and variants can be used for analysis of mutivariate count data.
> This package implements
> efficient algorithms extracting meaningful data from difficult to interpret
> and complex multivariate count data. It has been built to scale on large datasets even
> though it has memory limitations. Possible fields of applications include
> - Genomics (number of times a gene is expressed in a cell)
> - Ecology (number of individuals of some species in a specific site)
> One main functionality is to normalize the data to obtain more valueable data.
<!-- accompanied with a set of -->
<!-- > functions for visualization and diagnostic. See [this deck of -->
<!-- > slides](https://pln-team.github.io/slideshow/) for a -->
<!-- > comprehensive introduction. -->
## Getting started
The getting started can be found [here](https://forgemia.inra.fr/bbatardiere/pyplnmodels/-/raw/dev/Getting_started.ipynb?inline=false). If you need just a quick view of the package, see next.
[pypi](https://pypi.org/project/pyPLNmodels/). The development
version is available on [GitHub](https://github.com/PLN-team/pyPLNmodels).
```
pip install pyPLNmodels
```
The package comes with an ecological data set to present the functionality
from pyPLNmodels.models import PlnPCAcollection, Pln
from pyPLNmodels.oaks import load_oaks
oaks = load_oaks()
pln = Pln.from_formula("counts ~ 1 + tree + dist2ground + orientation ", data = oaks, take_log_offsets = True)
transformed_data = pln.transform()
### Rank Constrained Poisson lognormal for Poisson Principal Component Analysis (aka PLNPCA)
pca = PlnPCAcollection.from_formula("counts ~ 1 + tree + dist2ground + orientation ", data = oaks, take_log_offsets = True, ranks = [3,4,5])
transformed_data = pln.transform()
Please cite our work using the following references:
- J. Chiquet, M. Mariadassou and S. Robin: Variational inference for
probabilistic Poisson PCA, the Annals of Applied Statistics, 12:
2674–2698, 2018. [link](http://dx.doi.org/10.1214/18%2DAOAS1177)