Chongqing, China

brief description of this work

How to use online analysis tools

This online analysis model is based on the feature representation of lysine sites extracted from ESM2, which has been used to construct a cVAE_ResDNN model for ubi-prediction. Users can submit protein sequences to this online prediction tool, and can get the ubi-prediction outcomes.



1. Using the Online Upload Analyze Service (for ubi-prediction)


Step 1: Users need to log in to "https://eup.aibtit.com/upload" to use this service.


Step 2: Users need to provide the protein sequence in FASTA format for testing (for example data, please see the "Download Example Data" on this page).


Step 3: After submitting, users will receive an extraction code and can check the processing status of their test file.


Step 4: Once the processing is complete with the extraction code, users can download the results in an Excel spreadsheet and visualization images of ubiquitination sites.



2. Using the SHAP Visualization Service (for model interpretation)


Step 1: Users need to log in to "https://eup.aibtit.com/shap_upload" to use this service.


Step 2: Users need to upload the result Excel file from the Online Upload Analyze service (containing 2560 features) in the required format (for example data, please see the "Download Example Data" on this page).


Step 3: After submitting, users will receive an extraction code and can check the processing status of their test file.


Step 4: Once the processing is complete with the extraction code, users can download the SHAP visualization images for feature interpretation and further analysis.






About other works

Code and References Related to Its Work

If you are interested in protein subcellular localization prediction and model interpretability, you can refer to our previous work as follows:


Zeyu Luo, Rui Wang, Yawen Sun, Junhao Liu, Zongqing Chen, Yu-Juan Zhang, "Interpretable feature extraction and dimensionality reduction in ESM2 for protein localization prediction," published in Briefings in Bioinformatics, Volume 25, Issue 2, March 2024, article bbad534. The DOI for this publication is https://doi.org/10.1093/bib/bbad534.


We have also published many details related to this work on our GitHub repository, which you can access at https://github.com/yujuan-zhang/feature-representation-for-LLMs.