Many highly-expressed COGs were annotated as transport proteins. We used the metabolic network reconstructions for the acI genomes to systematically characterize the transport capabilities of the acI lineage. For each genome, we identified all transport reactions present in its metabolic network reconstruction. Gene-protein-reaction associations (GPRs) for these reactions were manually curated to remove unannotated proteins, group genes into operons (if applicable), and to identify missing subunits for multi-subunit transporters. These genes were then mapped to their corresponding COGs, and GPRs were grouped on the basis of their mapped COGs. Finally, consensus annotations within each clade were used to identify likely substrates for each of these groups.
The script code/annotTransporters/findGenesAnnotExpr.py
identifies all transport reactions, their GPRs and COGs, and the expression and annotation of each COG. The results are stored in data/transporters/cladeCOGs/clade.COG.norm
, and were manually curated to give the transporters in Table S13.
In [ ]: