🎓 About Me

I am an undergraduate researcher at Jiangnan University (Project 211 & Double First-Class). My work centres on AI for Science—especially biological sequence modelling, single-cell foundation models, protein/peptide design, and large-language-model reasoning.

I have authored or co-authored 7 papers (6 as first / co-first author, 6 accepted), co-spanning NeurIPS, ICML, PRCV, and ICIC, filed 1 invention patent, and won Gold at iGEM 2025 as the dry-lab lead.

I am actively seeking research opportunities (RA, summer internship, or graduate study) in AI for Life Sciences. Please feel free to reach out.

🌟 Research Vision

Biology speaks in sequences; AI is learning to listen. I am driven by two complementary pursuits:

  • Decoding biological sequences. Designing information-theoretic, interpretable foundation models for proteins, peptides, single-cell transcriptomes and beyond.
  • Reasoning machines for science. Building LLM-driven agents that fuse symbolic reasoning with deep representation learning to solve concrete scientific design problems.

"We must know — we will know." — David Hilbert

📰 News

  • 2026 RA-Det (5th author) accepted to ICML 2026 (CCF-A). NEW
  • 2026 Tokenization is Mechanism submitted to NeurIPS 2026 (CCF-A) as first author.
  • 2026 Four first-author papers accepted as Orals at ICIC 2026: ProtoGene · Extract-Then-Compile · Alignment-Adaptive Fusion · FWMamba-UNet.
  • 2025-11 iGEM 2025 Gold Medal awarded in Paris; presented as dry-lab lead.
  • 2025 MambaGuard (co-first author) accepted to PRCV 2025 (CCF-C).
  • 2025 Invention patent filed: "A Neuro-Symbolic Method & Apparatus for Language-Driven Travel Planning."
  • 2025 Awarded Outstanding Student Cadre of Wuxi City.

🏫 Education

Jiangnan University  (Project 211 · Double First-Class)

B.Eng. in Digital Media Technology, School of AI & Computer Science

Sep 2023 — Jun 2027  |  GPA: 88 / 100

Honours: Jiangnan University Honor Student (至善生), 1st-class Comprehensive Scholarship, Outstanding Student Cadre (校级), Outstanding Student Cadre of Wuxi City (市级).

📄 Publications

1 denotes first author · * denotes co-first author

1
Tokenization is Mechanism: Information-Asymmetric Token Merging for Biological Sequences
Zihao Guo1, et al.
NeurIPS 2026 · CCF-A Under Review First Author

pMHC binding prediction · information-theoretic token merging · interpretable mechanism · cross-domain biological sequence modelling (pMHC / TCR / DNA / SMILES).

2
ProtoGene: Bridging the RT-Gap in Single-Cell Foundation Models via Biology-Aware Prototypical Fine-Tuning
Zihao Guo1, et al.
ICIC 2026 · CCF-C · Oral Accepted First Author

Single-cell foundation model · prototypical contrastive fine-tuning · generalisation across sequencing protocols.

3
Extract Then Compile: Reliable Neuro-Symbolic Planning for Large Language Models
Zihao Guo1, et al.
ICIC 2026 · CCF-C · Oral Accepted First Author

LLM agent · neuro-symbolic reasoning · constraint-satisfaction planning & solving.

4
Fusion Direction Matters: Alignment-Adaptive Cross-Modal Fusion for Medical Image Segmentation
Zihao Guo1, et al.
ICIC 2026 · CCF-C · Oral Accepted First Author

Vision-language model · multimodal fusion · medical image segmentation.

5
FWMamba-UNet: Frequency-Wavelet Enhanced Mamba UNet for Medical Image Segmentation
Zihao Guo1, et al.
ICIC 2026 · CCF-C · Oral Accepted First Author

Mamba / state-space model · wavelet transform · medical image segmentation.

6
MambaGuard: A CLIP-Mamba Approach for OOD Generated Image Detection
Xinchang Wang*, Yuechen Zhang*, Zihao Guo*, et al.
PRCV 2025 · CCF-C Accepted Co-First Author

AI-generated image detection · vision-language model · Mamba.

7
RA-Det: Towards Universal Detection of AI-Generated Images via Robustness Asymmetry
Xinchang Wang, Yunhao Chen, Yuechen Zhang, Congcong Bian, Zihao Guo, et al.
ICML 2026 · CCF-A Accepted 5th Author

Universal detection of AI-generated images via robustness-asymmetry.

Patent — A Neuro-Symbolic Method & Apparatus for Language-Driven Travel Planning
Chinese Invention Patent (Filed)
Filed

🧪 Featured Projects

First Author · ICIC 2026 Oral

FWMamba-UNet — Frequency-Wavelet Enhanced Mamba UNet

A medical image segmentation network that augments the Mamba state-space backbone with frequency-domain and wavelet-transform branches. The frequency-wavelet enhancement captures cross-scale boundary cues that pure spatial-domain UNets miss, yielding consistent Dice / HD95 gains across multi-organ and pathology datasets while keeping the linear-time complexity of Mamba.

Mamba / SSMWaveletFFTUNet Medical ImagingPyTorch
iGEM 2025 · Gold Medal · Dry-Lab Lead

AMP Forge — Antimicrobial Peptide De-Novo Design

A general AMP design platform tackling antibiotic resistance via the human peptide LL-37 expressed in S. cerevisiae. Built the largest curated AMP corpus and a three-stage pipeline: ESM-2 / ProtT5 / Ankh + BiGRU-VAE latent space → Latent Diffusion → Transformer decoder, trained MLE → RL adversarial → diffusion fine-tuning. Supports six generation modes and achieves SOTA on multiple metrics; generated variants outperformed the wild type in wet-lab assays.

PyTorchESM-2ProtT5Ankh Latent DiffusionBiGRU-VAERLHF
First Author · ICIC 2026 Oral

ProtoGene — Single-Cell Foundation Model Fine-Tuning

A biology-aware prototypical fine-tuning framework that closes the read-technology gap between scRNA-seq protocols, improving the generalisation of pretrained single-cell foundation models on unseen datasets.

scGPTGeneformerContrastivePyTorch
First Author · NeurIPS 2026 (Under Review)

Tokenization-as-Mechanism

Information-asymmetric token merging that turns the tokenizer itself into an interpretable mechanism for biological sequence learning. Tested across pMHC, TCR, DNA, and SMILES with consistent gains and interpretable merging trees.

Information TheorypMHCTCRDNASMILES
First Author · ICIC 2026 Oral

SHINE — Neuro-Symbolic Travel Planner

Extract-Then-Compile: an LLM agent that lifts natural-language constraints into a symbolic program, then solves it reliably via classical search. Achieves a strong pass-rate on the ChinaTravel benchmark and ships as a filed invention patent.

LLM AgentNeuro-SymbolicConstraint Solving

📬 Contact

  1191230418@jiangnan.edu.cn

  +86 176-1462-8870

  Wuxi, Jiangsu, China

I am open to RA / internship / graduate-study collaborations in AI for Life Sciences.