Michele Papucci

mpapucci

·

https://michelepapucci.github.io

AI & ML interests

NLP, Controlled Text Generation, Interpretability and Explainability, Text Simplification, Hallucination Detection

Recent Activity

authored a paper 8 days ago

What Intermediate Layers Know: Detecting Jailbreaks from Entropy Dynamics

upvoted a paper 10 days ago

What Intermediate Layers Know: Detecting Jailbreaks from Entropy Dynamics

submitted a paper 10 days ago

What Intermediate Layers Know: Detecting Jailbreaks from Entropy Dynamics

View all activity

Organizations

None yet

authored a paper 8 days ago

What Intermediate Layers Know: Detecting Jailbreaks from Entropy Dynamics

Paper • 2606.25182 • Published 12 days ago • 5

submitted a paper to Daily Papers 10 days ago

What Intermediate Layers Know: Detecting Jailbreaks from Entropy Dynamics

Paper • 2606.25182 • Published 12 days ago • 5

authored a paper about 1 year ago

Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors

Paper • 2505.24523 • Published May 30, 2025 • 10