Question about HHpred parser

22 Apr 2020

      Dear CCTBX developers,

I am a postdoc at EPFL working with HHpred for homology modeling of membrane proteins.

I have been trying to write my own HHpred alignment parser until I found the python script under “cctbx_fork/iotbx/bioinformatics/__init__.py/” that contains an HHpred parser.

My goal is to correctly parse the raw HHpred output file (.hhr), which involves unwrapping every alignment, parsing out a lot of text to finally obtain something like this:
...
pdb_name
query-sequence

column score

Example:
...
4U15
VYGFIGGIFGFMSIMTMAMISIDRYNVIGRPMAASKKMSHRRAFIMIIFVWLWS

+........+..+..++|+++|++++.++.+.++++ +..+.++.+|+++|++.++...+........     +...|..

Being somewhat new to python, I was wondering whether the people who wrote this script are still around and could help me figure out whether the parser could be implemented in such a way.

Thanks for any help you can provide!

Best,

Louis D

Louis Dumas

Gabor Bunkoczi

tags

participants (2)