Yasunori Ozaki's picture

In a Training Loop 🔄

Yasunori Ozaki PRO

alfredplpl

·

https://alfredplpl.github.io/en/index.html

AI & ML interests

Computer Vision, LLM

Recent Activity

liked a model 6 days ago

HauhauCS/Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive

upvoted a paper 6 days ago

MONET: A Massive, Open, Non-redundant and Enriched Text-to-image dataset

upvoted a collection 6 days ago

MONET - Massive Open Non-redundant, Enriched, Text-to-image

View all activity

Organizations

upvoted a paper 6 days ago

MONET: A Massive, Open, Non-redundant and Enriched Text-to-image dataset

Paper • 2605.21272 • Published 16 days ago • 3

upvoted a collection 6 days ago

MONET - Massive Open Non-redundant, Enriched, Text-to-image

A curated, deduped & recaptioned open image–text dataset of 104.9M samples released under the Apache2.0 licence. https://huggingface.co/blog/jasperai/ • 4 items • Updated 7 days ago • 10

upvoted a collection 9 days ago

Bonsai Image

6 items • Updated about 5 hours ago • 82

upvoted a collection 11 days ago

Jagle

Jagle: Building a Large-Scale Japanese Multimodal Post-Training Dataset for Vision–Language Models • 5 items • Updated Apr 12 • 2

upvoted a collection 13 days ago

MobileCLIP2

MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B • 30 items • Updated Apr 23 • 62

upvoted a paper 13 days ago

L2P: Unlocking Latent Potential for Pixel Generation

Paper • 2605.12013 • Published 24 days ago • 36

upvoted a paper 20 days ago

Asymmetric Flow Models

Paper • 2605.12964 • Published 23 days ago • 22

upvoted a paper 22 days ago

Qwen-Image-VAE-2.0 Technical Report

Paper • 2605.13565 • Published 23 days ago • 60

upvoted a paper 23 days ago

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published 25 days ago • 110

upvoted a paper 24 days ago

STARFlow2: Bridging Language Models and Normalizing Flows for Unified Multimodal Generation

Paper • 2605.08029 • Published 28 days ago • 12

upvoted 2 papers 27 days ago

Continuous-Time Distribution Matching for Few-Step Diffusion Distillation

Paper • 2605.06376 • Published 29 days ago • 26

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 29 days ago • 80

upvoted a collection 29 days ago

SenseNova-U1

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-Unify Architecture • 9 items • Updated 8 days ago • 69

upvoted 2 collections about 1 month ago

GenLIP

Model weights of paper "Let ViT Speak: Generative Language-Image Pre-training" • 6 items • Updated about 1 month ago • 6

imabari-dialect-models

今治弁モデル • 6 items • Updated Apr 23 • 2

upvoted a paper about 1 month ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published Apr 27 • 118

upvoted a collection about 1 month ago

MiMo-V2.5

4 items • Updated Apr 27 • 88

upvoted a paper about 1 month ago

AVControl: Efficient Framework for Training Audio-Visual Controls

Paper • 2603.24793 • Published Mar 25 • 28

upvoted 2 collections about 1 month ago

MiDashengLM-7B-1021

4 items • Updated Oct 27, 2025 • 2

DeepSeek-V4

4 items • Updated Apr 24 • 672