Haiwen Diao's picture

Haiwen Diao

Paranioar

·

https://Paranioar.github.io/

AI & ML interests

Vision-and-Language, Parameter-efficient Transfer Learning, Multi-modal Large Language Model

Recent Activity

authored a paper 2 days ago

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

authored a paper 2 days ago

VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text?

authored a paper 2 days ago

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

View all activity

Organizations

authored 3 papers 2 days ago

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

Paper • 2601.22153 • Published Jan 29 • 75

VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text?

Paper • 2602.04802 • Published Feb 4 • 2

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published 4 days ago • 168

updated a collection 3 days ago

SenseNova-U1

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-Unify Architecture • 8 items • Updated about 15 hours ago • 62

upvoted a paper 3 days ago

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published 4 days ago • 168

submitted a paper to Daily Papers 3 days ago

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published 4 days ago • 168

liked 3 models 3 days ago

sensenova/SenseNova-U1-8B-MoT-LoRAs

Updated 1 day ago • 4

sensenova/SenseNova-U1-A3B-MoT

39B • Updated 1 day ago • 119 • 13

sensenova/SenseNova-U1-A3B-MoT-SFT

39B • Updated 1 day ago • 97 • 8

updated a collection 9 days ago

SenseNova-U1

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-Unify Architecture • 8 items • Updated about 15 hours ago • 62

liked a model 13 days ago

sensenova/SenseNova-U1-8B-MoT-8step-preview

Any-to-Any • 18B • Updated 1 day ago • 402 • 12

liked 2 models 15 days ago

sensenova/SenseNova-U1-8B-MoT-SFT

Any-to-Any • 18B • Updated 1 day ago • 1.69k • 51

sensenova/SenseNova-U1-8B-MoT

Any-to-Any • 18B • Updated about 13 hours ago • 11.4k • 261

upvoted a collection 15 days ago

SenseNova-U1

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-Unify Architecture • 8 items • Updated about 15 hours ago • 62

commented on NEO-unify: Building Native Multimodal Unified Models End to End 18 days ago

Hopefully they can involve NEO-Unify in discussions of their paper, haha~

commented on NEO-unify: Building Native Multimodal Unified Models End to End 18 days ago

Model available on HF ? 👀

We just open-sourced it here: https://github.com/OpenSenseNova/SenseNova-U1 .
Give it a try!

upvoted a paper 19 days ago

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Paper • 2604.22748 • Published 22 days ago • 226

updated a collection 24 days ago

SenseNova-U1

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-Unify Architecture • 8 items • Updated about 15 hours ago • 62