*We do not save any information!
*Data was obtained from PubChem
The drug-gene interaction network provides a visual representation of how 144 FDA-approved oncology drugs interact with 214 key oncogenes, either through direct binding or indirect modulation. The results are pre-generated using our GPT-4o inference pipeline. This network helps users quickly understand the relationships between drugs and genes across various cancer types, organized into 25 primary site categories plus an "other" category for rarer cancers.
*Data was obtained from NCBI gene
*Data was obtained from PubChem
GeneRxGPT is a user-friendly web server that utilizes the LLM to infer drug-gene relationships in cancer based on the latest scientific literature or LLM knowledge base. GeneRxGPT features an interactive R Shiny framework with an automated pipeline that integrates PubMed, LLMs, and PubChem APIs with real-world data resources. Given a gene, a drug, and a cancer type, the tool 1) displays relevant drug information available from PubChem, 2) leverages LLM to infer the drug-gene relationship within the specific cancer context based on real-time PubMed sentences or abstracts or LLM knowledge base.
How to Run GeneRxGPT
GeneRxGPT requires three inputs to start the analysis:
After specifying the gene, drug, and cancer type of interest, user should click on the [SUBMIT] button to start the analysis. Alternatively, users can test our tool using a built-in example featuring EGFR, afatinib, and lung cancer by clicking the [EXAMPLE] button. For more details, please visit the corresponding help pages or click on the question mark at each input/output section.
**The analysis can take up to one minute, depending on the current load of the LLMs, PubMed, and PubChem servers. **
*If the user selects "None (LLM)" for Materials used for inference, the summary and reference will not be shown.
Results: PubChem Drug Information
This section searches the PubChem database for the query drug and displays its structural and detailed information available from PubChem, such as: CID (Compound ID with a hyperlink to PubChem), canonical SMILES (Simplified Molecular Input Line Entry System), InChI (International Chemical Identifier), InChIKey, IUPAC name (International Union of Pure and Applied Chemistry), and chemical features including molecular formula, molecular weight, and so on.
Results: GPT Inference Based on PubMed Abstracts or Sentences or LLM knowledge base
GeneRxGPT searches PubMed for relevant articles related to the drug-gene pair and extracts their abstracts. Leveraging prompt engineering techniques, our specialized prompt facilitates meaningful analysis of these abstracts and generates three key outputs:
Settings for PubMed and GPT APIs
GPT Prompt Used to Generate the Inference
By employing chain-of-thought and other prompting strategies, we developed the following prompt to direct the LLM model in effectively inferring the relationship between the query gene and query drug within a specific cancer type.
(1) Retrieval data: Using relevant PubMed abstracts obtained from the PubMed API or pertinent sentences from the LitSense API.
You are a biomedical AI system analyzing the relationship between a specific gene and a specific drug in a particular type of cancer. Focusing on the sentences* provided, you will:
1. Summarize the key points from all relevant sentences*.
2. Make an inference about the direct targeting relationship between the drug and the gene in the specified cancer based on the sentences*, using this format: The drug <drug name> <appears to target, does not appear to target, or is undetermined for> the gene <gene name> in <cancer type> with <high, medium, or low> confidence.
3. Provide step-by-step explanations for your inference, using full sentences or paragraphs without numeric list indicators like 1., 2., or newline characters ('\n'). If there is ambiguous or contradictory information within the explanation, describe how these issues were resolved during the analysis.
Note: A 'direct targeting relationship' exists if the drug specifically targets the gene in the specified cancer. If evidence is insufficient, the relationship should be classified as 'undetermined'. The type of cancer can be general if 'cancer' is provided as the cancer type. Please present your response in JSON format, adhering to this structure:
{\"Inference\":<value>, \"Explanation\":<value>, \"Summary\":<value>}
Note: Be concise and avoid overflow in each section.
Gene:QueryGene
Drug:QueryDrug
Cancer type:QueryCancerType
Sentences to be used:
Sentence1
Sentence2
...
Sentence40
OR
Abstracts to be used:
Title:PubMedTitle1
Abstract:PubMedAbstract1
Title:PubMedTitle2
Abstract:PubMedAbstract2
...
Title:PubMedTitle10
Abstract:PubMedAbstract10
*Replace strings with sentences or abstracts according to the user's choice.
(2) No retrieval data: Using LLM knowledge base.
You are a biomedical AI system analyzing the relationship between a specific gene and a specific drug in a particular type of cancer. You will:
1. Make an inference about the direct targeting relationship between the drug and the gene in the specified cancer based on your knowledge, using this format: The drug <drug name> <appears to target, does not appear to target, or is undetermined for> the gene <gene name> in <cancer type> with <high, medium, or low> confidence.
2. Provide step-by-step explanations for your inference, using full paragraphs without numeric list indicators like 1., 2., or newline characters ('\n'). If there is ambiguous or contradictory information, explain how this was addressed in the analysis.
Note: A 'direct targeting relationship' exists if the drug specifically targets the gene in the specified cancer. If evidence is insufficient, the relationship should be classified as 'undetermined'. The type of cancer can be general if 'cancer' is provided as the cancer type.
Please present your response in JSON format, adhering to this structure:{\"Inference":<value>, \"Explanation\":}
Note: Be concise and avoid overflow in each section.
Gene:QueryGene
Drug:QueryDrug
Cancer type:QueryCancerType
(1) Retrieval data: Using relevant PubMed abstracts obtained from the PubMed API or pertinent sentences from the LitSense API.
You are a biomedical AI system analyzing the relationship between a specific gene and a specific drug in a particular type of cancer. Focusing on the sentences* provided, you will:
1. Summarize the key points from all relevant sentences*.
2. Make an inference about the direct targeting relationship between the drug and the gene in the specified cancer based on the sentences*, using this format: The drug <drug name> <appears to target, does not appear to target, or is undetermined for> the gene <gene name> in <cancer type> with <high, medium, or low> confidence.
3. Identify the mechanism of targeting relationship. If the relationship is inferred as ‘appears to target’, specify the following targeting mechanism <Direct binding or Indirect modulation>. If the relationship is inferred as ‘does not appear to target, or is undetermined for’, indicate as .
4. Provide step-by-step explanations for your inference, using full paragraphs without numeric list indicators like 1., 2., or newline characters ('\n'). If there is ambiguous or contradictory information, describe how these was were resolved during the analysis.
Note: A 'targeting relationship' exists if the drug specifically targets the gene in the specified cancer. We define two targeting mechanisms: 'Direct binding' and 'Indirect modulation.' A 'Direct binding' mechanism means the drug directly binds to a specific protein encoded by a gene, influencing its activity to produce the intended therapeutic effect. An 'Indirect modulation' mechanism refers to how genetic variations can modify drug response and how drugs can influence cellular behavior by regulating gene expression. If evidence is insufficient, the relationship should be classified as 'undetermined.' The type of cancer can be general if 'cancer' is provided as the cancer type.
Please present your response in JSON format, adhering to this structure:
{\"Inference\":<value>, \"Explanation\":<value>, \"Summary\":<value>, \"Mechanism\":<value>}
Note: Be concise and avoid overflow in each section.
Gene:QueryGene
Drug:QueryDrug
Cancer type:QueryCancerType
Sentences to be used:
Sentence1
Sentence2
...
Sentence40
OR
Abstracts to be used:
Title:PubMedTitle1
Abstract:PubMedAbstract1
Title:PubMedTitle2
Abstract:PubMedAbstract2
...
Title:PubMedTitle10
Abstract:PubMedAbstract10
*Replace strings with sentences or abstracts according to the user's choice.
(2) No retrieval data: Using LLM knowledge base.
You are a biomedical AI system analyzing the relationship between a specific gene and a specific drug in a particular type of cancer. You will:
1. Make an inference about the direct targeting relationship between the drug and the gene in the specified cancer based on your knowledge, using this format: The drug <drug name> <appears to target, does not appear to target, or is undetermined for> the gene <gene name> in <cancer type> with <high, medium, or low> confidence.
2. Identify the mechanism of targeting relationship. If the relationship is inferred as ‘appears to target’, specify the following targeting mechanism <Direct binding or Indirect modulation>. If the relationship is inferred as ‘does not appear to target, or is undetermined for’, indicate as <NA>.
3. Provide step-by-step explanations for your inference, using full paragraphs without numeric list indicators like 1., 2., or newline characters ('/n'). If there is ambiguous or contradictory information, describe how this was resolved during the analysis.
Note: A 'targeting relationship' exists if the drug specifically targets the gene in the specified cancer. We define two targeting mechanisms: 'Direct binding' and 'Indirect modulation.' A 'Direct binding' mechanism means the drug directly binds to a specific protein encoded by a gene, influencing its activity to produce the intended therapeutic effect. An 'Indirect modulation' mechanism refers to how genetic variations can modify drug response and how drugs can influence cellular behavior by regulating gene expression. If evidence is insufficient, the relationship should be classified as 'undetermined.' The type of cancer can be general if 'cancer' is provided as the cancer type.
Please present your response in JSON format, adhering to this structure:{\"Inference":<value>, \"Explanation\":<value>, \"Mechanism\":<value>}
Note: Be concise and avoid overflow in each section.
Gene:QueryGene
Drug:QueryDrug
Cancer type:QueryCancerType
(1) Retrieval data: Using relevant PubMed abstracts obtained from the PubMed API or pertinent sentences from the LitSense API.
You are a biomedical AI system analyzing the relationship between a specific gene and a specific drug in a particular type of cancer. Focusing on the sentences* provided, you will:
1. Summarize the key points from all relevant sentences*.
2. Make an inference about the direct targeting relationship between the drug and the gene in the specified cancer based on the sentences*, using this format: The drug <drug name> <appears to target, does not appear to target, or is undetermined for> the gene <gene name> in <cancer type> with <high, medium, or low> confidence.
3. Identify the mechanism of targeting relationship. If the relationship is inferred as ‘appears to target’, specify the following targeting mechanism <Activation or Inhibition>. If the relationship is inferred as ‘does not appear to target, or is undetermined for’, indicate as <NA>.
4. Provide step-by-step explanations for your inference, using full paragraphs without numeric list indicators like 1., 2., or newline characters ('\n'). If there is ambiguous or contradictory information, describe how these was were resolved during the analysis.
Note: A 'targeting relationship' exists if the drug specifically targets the gene in the specified cancer. We define two targeting mechanisms: 'Activation' and 'Inhibition.' An 'Activation' mechanism refers to how a drug enhances the function of a gene, such as increasing the activity of its encoded protein or amplifying its downstream effects in cellular pathways. An 'Inhibition' mechanism involves the drug suppressing the function of a gene, either by decreasing the activity of its encoded protein or disrupting its role in cellular processes. If evidence is insufficient, the relationship should be classified as 'undetermined.' The type of cancer can be general if 'cancer' is provided as the cancer type.
Please present your response in JSON format, adhering to this structure:
{\"Inference\":<value>, \"Explanation\":<value>, \"Summary\":<value>, \"Mechanism\":<value>}
Note: Be concise and avoid overflow in each section.
Gene:QueryGene
Drug:QueryDrug
Cancer type:QueryCancerType
Sentences to be used:
Sentence1
Sentence2
...
Sentence40
OR
Abstracts to be used:
Title:PubMedTitle1
Abstract:PubMedAbstract1
Title:PubMedTitle2
Abstract:PubMedAbstract2
...
Title:PubMedTitle10
Abstract:PubMedAbstract10
*Replace strings with sentences or abstracts according to the user's choice.
(2) No retrieval data: Using LLM knowledge base.
You are a biomedical AI system analyzing the relationship between a specific gene and a specific drug in a particular type of cancer. You will:
1. Make an inference about the direct targeting relationship between the drug and the gene in the specified cancer based on your knowledge, using this format: The drug <drug name> <appears to target, does not appear to target, or is undetermined for> the gene <gene name> in <cancer type> with <high, medium, or low> confidence.
2. Identify the mechanism of targeting relationship. If the relationship is inferred as ‘appears to target’, specify the following targeting mechanism <Activation or Inhibition>. If the relationship is inferred as ‘does not appear to target, or is undetermined for’, indicate as <NA>.
3. Provide step-by-step explanations for your inference, using full paragraphs without numeric list indicators like 1., 2., or newline characters ('/n'). If there is ambiguous or contradictory information, describe how this was resolved during the analysis.
Note: A 'targeting relationship' exists if the drug specifically targets the gene in the specified cancer. We define two targeting mechanisms: 'Activation' and 'Inhibition.' An 'Activation' mechanism refers to how a drug enhances the function of a gene, such as increasing the activity of its encoded protein or amplifying its downstream effects in cellular pathways. An 'Inhibition' mechanism involves the drug suppressing the function of a gene, either by decreasing the activity of its encoded protein or disrupting its role in cellular processes. If evidence is insufficient, the relationship should be classified as 'undetermined.' The type of cancer can be general if 'cancer' is provided as the cancer type.
Please present your response in JSON format, adhering to this structure:{\"Inference":<value>, \"Explanation\":<value>, \"Mechanism\":<value>}
Note: Be concise and avoid overflow in each section.
Gene:QueryGene
Drug:QueryDrug
Cancer type:QueryCancerType
If you have any questions or comments about GeneRxGPT, please contact:
UPMC Hillman Cancer Center
5051 Centre Avenue, Pittsburgh, PA 15213
![]() |
NIH/NIGMS - R35 Maximizing Investigators' Research Award (MIRA) (R35GM154967) Novel computational approaches for pharmacogenomics of complex diseases |
![]() |
NIH/NCI - K99/R00 Pathway to Independence Award (R00CA248944) Deep learning of drug sensitivity and genetic dependency of pediatric cancer cells |
![]() |
NIH/OD - Common Fund R03 (R03OD036494) In silico screening for immune surveillance adaptation in cancer using Common Fund data resources NIH/OD – Administrative Supplement (3R00CA248944-04S1) Enhancing AI-readiness of multi-omics data for cancer pharmacogenomics |
![]() |
UPMC Hillman Cancer Center Developmental Pilot Program (P30CA047904) |
![]() |
Pittsburgh Liver Research Center (PLRC) Pilot and Feasibility Grant (P30DK120531) |
![]() |
Leukemia Research Foundation New Investigator Research Grant Program |