https://github.com/dmulder/phylogen

As part of a graduate project I created the phylogen script, which combines the ASTRAL and IQTREE analysis tools into one iterative approach to finding the highest bootstrap score tree for various species of Mayflies. At the completion of the project, there was some work left undone, and I had a few ideas on how to improve the average bootstrap score of the final tree. For this hackweek, I'll complete the phylogen script by; first, calling the ASTRAL jar file in python using jpype, and second, creating python bindings to the IQTREE c code. Once these are complete, I hope that the script will run a bit faster, enabling me to run additional analysis on the mayfly genetic encoding data, and revealing a better tree.

Looking for hackers with the skills:

Nothing? Add some keywords!

This project is part of:

Hack Week 18

Activity

  • almost 5 years ago: dmaiocchi liked this project.
  • almost 5 years ago: dmulder started this project.
  • almost 5 years ago: dmulder originated this project.

  • Comments

    • dmulder
      almost 5 years ago by dmulder | Reply

      I've completed the jpype work, and now I'm working on the iqtree python bindings.

    • dmulder
      almost 5 years ago by dmulder | Reply

      Today I finished the iqtree python bindings. It currently works on a single node with multiple cores. Next I need to enable multi-node processing with MPI (it already has some MPI code, but needs reworked to use the latest changes).

    • dmulder
      almost 5 years ago by dmulder | Reply

      Enabled multi-node using mpi4py MPIPoolExecutor. This works effectively the same as multiprocessing.Pool (which I was already using for single node processing). Need to add some intelligence around picking the best exon lengths, otherwise the cluster exhausts all available memory running multiple instances of ASTRAL!

    Similar Projects

    This project is one of its kind!