Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
87
89
120
Kashif Rasul
kashif
Follow
MWilinski's profile picture
CESI-LINEACT-Laboratory2023's profile picture
Kibalama's profile picture
393 followers
·
92 following
krasul
kashif
AI & ML interests
Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning
Recent Activity
liked
a Space
about 12 hours ago
multimodalart/LLaDA-2-1
new
activity
1 day ago
kuleshov-group/bd3lm-owt-block_size16:
Add post_init() and register_buffer(persistent=False) for transformers v5
new
activity
1 day ago
kuleshov-group/bd3lm-owt-block_size8:
Add post_init() and register_buffer(persistent=False) for transformers v5
View all activity
Organizations
kashif
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
kuleshov-group/bd3lm-owt-block_size16
1 day ago
Add post_init() and register_buffer(persistent=False) for transformers v5
#3 opened 1 day ago by
kashif
New activity in
kuleshov-group/bd3lm-owt-block_size8
1 day ago
Add post_init() and register_buffer(persistent=False) for transformers v5
#2 opened 1 day ago by
kashif
commented
a paper
1 day ago
LLaDA2.1: Speeding Up Text Diffusion via Token Editing
Paper
•
2602.08676
•
Published
Feb 9
•
70
•
5
New activity in
kuleshov-group/bd3lm-owt-block_size4
1 day ago
Add post_init() call for transformers v5 compatibility
#2 opened 1 day ago by
kashif
New activity in
inclusionAI/LLaDA2.1-mini
8 days ago
fixes for transforemrs v5
1
#4 opened 12 days ago by
kashif
New activity in
inclusionAI/LLaDA2.0-mini-CAP
12 days ago
fix: align RotaryEmbedding and _init_weights with Qwen2Moe for transformers compat
#2 opened 12 days ago by
kashif
New activity in
inclusionAI/LLaDA2.1-flash
12 days ago
fix: align RotaryEmbedding with Qwen2Moe pattern for transformers compat
#4 opened 12 days ago by
kashif
New activity in
nyu-visionx/RAE-mae-base-p16-ViTXL-n08
26 days ago
Update weights: include latent normalization buffers for diffusers compatibility
#3 opened 26 days ago by
kashif
New activity in
nyu-visionx/RAE-siglip2-base-p16-i256-ViTXL-n08
26 days ago
Update weights: include latent normalization buffers for diffusers compatibility
#4 opened 26 days ago by
kashif
New activity in
nyu-visionx/RAE-dinov2-wReg-large-ViTXL-n08
26 days ago
Update weights: include latent normalization buffers for diffusers compatibility
#4 opened 26 days ago by
kashif
New activity in
nyu-visionx/RAE-dinov2-wReg-small-ViTXL-n08
26 days ago
Update weights: include latent normalization buffers for diffusers compatibility
#3 opened 26 days ago by
kashif
New activity in
nyu-visionx/RAE-dinov2-wReg-base-ViTXL-n08-i512
26 days ago
Update weights: include latent normalization buffers for diffusers compatibility
#3 opened 26 days ago by
kashif
New activity in
nyu-visionx/RAE-dinov2-wReg-base-ViTXL-n08
26 days ago
Update weights: include latent normalization buffers for diffusers compatibility
#4 opened 26 days ago by
kashif
New activity in
google/timesfm-2.5-200m-transformers
29 days ago
updated config and weights
#3 opened 29 days ago by
kashif
Upload 2 files
#2 opened about 1 month ago by
kashif
New activity in
nyu-visionx/RAE-siglip2-base-p16-i256-ViTXL-n08
29 days ago
Remap encoder keys to match SiglipVisionModel key layout
#3 opened 29 days ago by
kashif
New activity in
nyu-visionx/RAE-dinov2-wReg-large-ViTXL-n08
29 days ago
Add encoder_num_hidden_layers=24 to config
1
#3 opened 29 days ago by
kashif
New activity in
nyu-visionx/RAE-mae-base-p16-ViTXL-n08
about 1 month ago
Update config for diffusers AutoencoderRAE refactor
#2 opened about 1 month ago by
kashif
New activity in
nyu-visionx/RAE-siglip2-base-p16-i256-ViTXL-n08
about 1 month ago
Update config for diffusers AutoencoderRAE refactor
#2 opened about 1 month ago by
kashif
New activity in
nyu-visionx/RAE-dinov2-wReg-large-ViTXL-n08
about 1 month ago
Update config for diffusers AutoencoderRAE refactor
#2 opened about 1 month ago by
kashif
Load more