slackbuilds_ponce/academic/hyphy/README
Petar Petrov 26da2395a3
academic/hyphy: Updated for version 2.5.50.
Signed-off-by: bedlam <dave@slackbuilds.org>

Signed-off-by: Willy Sudiarto Raharjo <willysr@slackbuilds.org>
2023-05-13 19:27:12 +07:00

61 lines
2.9 KiB
Text

HyPhy: Hypothesis testing using Phylogenies
HyPhy is an open-source software package for the analysis of genetic
sequences (in particular the inference of natural selection) using
techniques in phylogenetics, molecular evolution, and machine learning.
It features a rich scripting language for limitless customization of
analyses. Additionally, HyPhy features support for parallel computing
environments (via message passing interface).
HyPhy was designed to allow the specification and fitting of a broad
class of continuous-time discrete-space Markov models of sequence
evolution. To implement these models, HyPhy provides its own scripting
language - HBL, or HyPhy Batch Language, which can be used to develop
custom analyses or modify existing ones. Importantly, it is not
necessary to learn (or even be aware of) HBL in order to use HyPhy, as
most common models and analyses have been implemented for user
convenience. Once a model is defined, it can be fitted to data (using a
fixed topology tree), its parameters can be constrained in user-defined
ways to test various hypotheses (e.g. is rate1 > rate2), and simulate
data from. HyPhy primarily implements maximum likelihood methods, but
it can also be used to perform some forms of Bayesian inference (e.g.
FUBAR), fit Bayesian graphical models to data, run genetic algorithms to
perform complex model selection.
Features
- Support for arbitrary sequence data, including nucleotide, amino-acid,
codon, binary, count (microsattelite) data, including multiple
partitions mixing differen data types.
- Complex models of rate variation, including site-to-site, branch-to-
branch, hidden markov model (autocorrelated rates), between/within
partitions, and co-varion type models.
- Fast numerical fitting routines, supporting parallel and distributed
execution.
- A broad collection of pre-defined evolutionary models.
- The ability to specify flexible constraints on model parameters and
estimate confidence intervals on MLEs.
- Ancestral sequence reconstruction and sampling.
- Simulate data from any model that can be defined and fitted in the
language.
- Apply unique (for this domain) machine learning methods to discover
patterns in the data, e.g. genetic algorithms, stochastic context free
grammars, Bayesian graphical models.
- Script analyses completely in HBL including flow control, I/O,
parallelization, etc.
Registration
you are highly advised to fill the registration form found at:
https://veg.github.io/hyphy-site/register/
NOTE!
The script builds two executables: HYPHYMP, which uses pthreads to do
multiprocessing and HYPHYMPI, which uses openmpi (hence the dependency).
HyPhy Vision
To visualise your results produced offline, you can use HyPhy Vision:
http://vision.hyphy.org/
Citing
Sergei L. Kosakovsky Pond, Simon D. W. Frost and Spencer V. Muse (2005)
HyPhy: hypothesis testing using phylogenies.
Bioinformatics 21(5): 676-679