29.05.2021 Views

Phylogenetic analysis based on Machine Learning Algorithm

The interpretation of the phylogenetic tree is an essential yet challenging aspect of evolutionary studies. To conduct an evolutionary study of the organisms is the core of biological research. The resulting phylogeny is then subjected to a plethora of analyses essential for further genomic research (Azouri 2021). The phylogenetic analysis involves several methods that can be used to interpret data. Recently, researchers have begun studying the use of machine learning in inferring phylogenetic trees. Contact:

The interpretation of the phylogenetic tree is an essential yet challenging aspect of evolutionary studies. To conduct an evolutionary study of the organisms is the core of biological research. The resulting phylogeny is then subjected to a plethora of analyses essential for further genomic research (Azouri 2021). The phylogenetic analysis involves several methods that can be used to interpret data. Recently, researchers have begun studying the use of machine learning in inferring phylogenetic trees.

Contact:

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

PHYLOGENETIC ANALYSIS

USING MACHINE LEARNING

An Academic presentation by

Dr. Nancy Agnes, Head, Technical Operations, Tutors India

Group www.tutorsindia.com

Email: info@tutorsindia.com


Today's Discussion

OUTLINE

Introduction

Phylogenetic Analysis

Currently available methods for inference

Application of machine learning

Future scope


INTRODUCTION

The interpretation of the phylogenetic tree is an

essential yet challenging aspect of evolutionary studies.

To conduct an evolutionary study of the organisms is the

core of biological research.

The resulting phylogeny is then subjected to a plethora of

analyses essential for further genomic research (Azouri

2021).

The phylogenetic analysis involves several methods

that can be used to interpret data. Recently, researchers

have begun studying the use of machine learning in

inferring phylogenetic trees.

Contd...


PHYLOGENETIC

ANALYSIS

The study of the evolutionary history of a species or a

group of organisms is known as phylogenetic analysis.

Here, the evolutionary relationship between different

species or organisms having a common ancestor is

represented with the help of branching diagrams.

This diagram is called the phylogenetic tree, which can be

either rooted or unrooted.

Phylogenetic analysis can also be used to study the

relationship between characteristics of an organism,

including genes and proteins.

Contd...


The applications of phylogenetic analysis are numerous.

These include – reconstruction of the ancestral gene for the derivation of extant

genes, study of human disease and epidemiology, interpretation of the evolution of

ecological and behavioural traits, estimation of historical biogeographic

relationships, and many more.

Interesting Blog: Performance Evaluation Metrics for Machine-Learning Based

Dissertation


CURRENTLY

AVAILABLE

METHODS FOR

INFERENCE

Previously, morphological features were used in the

assessment of similarities among species and in

phylogenetic analysis.

It has drastically changed over time. Nowadays, this

analysis uses information extracted from DNA, RNA or

protein.

The generation of a phylogenetic tree involves the

alignment of sequences.

The most widely-used tool for this is the alignment-based

methodology.

Contd...


In this method, the two sequences are stacked in a way to highlight their common

symbols and substrings.

This comparison of sequences helps to identify patterns of shared ancestry between

species.

(Munjal 2019). However, exploiting these large-scale molecular data poses

significant challenges.

One of the most difficult tasks is to develop effective techniques for the extraction of

missing data.

Contd...


The Maximum likelihood or Markov Chain Monte Carlo (MCMC) methods and

probabilistic models of sequence evolution are highly reliable statistical methods used

for the reconstruction of gene and species trees.

Even so, many of these approaches are not scalable enough to study phylogenomic

datasets of hundreds or thousands of genes and taxa.

Thus, the development of a quick and efficient method is the need of the hour (

Bhattacharjee 2020).



APPLICATION OF

MACHINE

LEARNING

Machine learning has found various applications in the

field of technology-driven research.

One such usage of machine learning is in the

inference of the phylogenetic tree.

In a recent study, researchers utilized the machine

learning method to predict the best model for the most

common prediction task: phylogenetic tree

reconstruction for a given collection of sequences

(Abadi 2020).

Contd...


A research study gave a detailed analysis of plant diversity trends to date,

demonstrating that using machine learning to forecast future diversity could be

tremendously beneficial.

They applied machine learning approaches to phylogenetic diversity in vascular plants

(Park 2020). Bhattacharjee et al.,

for the very first time, demonstrated the potential and feasibility of using deep learning

techniques to compute distance matrices.

The study evaluated both matrix factorization (ME) and autoencoder (AE) and aimed to

develop improvised models for better results.

Contd...


They showed that both these methods are reliable and can be applied for handling

large-scale datasets.

They also highlighted the ability of these techniques over the heuristic-based

techniques to automatically learn complicated inter-variable associations.

Their research can also be used as a model for applying machine learning methods to

the phylogenetic analysis (Bhattacharjee 2020).

In another research, a machine learning framework was developed to rank the

neighbouring trees in accordance with their prosperity to increase the likelihood.

Contd...


They applied multiple features and utilized machine learning to improve an optimal

tool. The study suggested specific ways to practice machine learning algorithms in

phylogenetic analysis.

Furthermore, they presented a methodology that can significantly speed up treesearch

algorithms without sacrificing accuracy(Azouri 2021).

A recent review focused on the application of machine learning-based techniques in

the data analysis of the human microbiome.

It provided an insight into the plethora of advantages that machine learning has to

offer over classical methods.

Contd...


The most common techniques covered in this review involved Support Vector

Machines, Random Forest, k-NN and Logistic Regression.

This review suggested how machine learning can contribute to the development of

new models that can be useful in predicting classifications in the field of microbiology,

inferring host phenotypes to predict diseases and characterization of state-specific

microbial signatures using microbial communities(Macros 2021).

Contd...


Contd...


FUTURE SCOPE

Machine learning has found various applications in the

field of technology-driven research.

One such usage of machine learning is in the

inference of the phylogenetic tree.

In a recent study, researchers utilized the machine

learning method to predict the best model for the most

common prediction task: phylogenetic tree

reconstruction for a given collection of sequences

(Abadi 2020).Future scope

Contd...


Contd...


CONTACT US

UNITED KINGDOM

+44-1143520021

INDIA

+91-4448137070

EMAIL

info@tutorsindia.com

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!