Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up

All HF Hub posts

SeaWolf-AIย 
posted an update about 10 hours ago
view post
Post
1504
๐Ÿงฌ Darwin V6: Diagnostic-Guided Evolutionary Model Merging

We are releasing Darwin-31B-Opus โ€” a reasoning-enhanced model merging Google's Gemma-4-31B-it and TeichAI's Claude Opus Distill using the Darwin V6 engine.

Model: FINAL-Bench/Darwin-31B-Opus
Demo: FINAL-Bench/Darwin-31B-Opus

๐Ÿ”ฌ What Darwin V6 Does

Conventional merging tools (mergekit, etc.) apply a single ratio to all tensors. Set ratio=0.5 and all 1,188 tensors blend identically, with no distinction between which tensors matter for reasoning versus coding.

Darwin V6 diagnoses both parents at the tensor level before merging. It measures Shannon entropy, standard deviation, and L2 norm for every tensor, then passes 5 diagnostic probes (REASONING, CODE, MATH, KNOWLEDGE, LANGUAGE) through the model to determine layer-wise functional importance. Each of the 1,188 tensors receives an independent optimal ratio.

combined = static(entropy/std/norm) x 0.4 + probe(cosine_distance) x 0.6
final_ratio = mri_ratio x mri_trust + genome_ratio x (1 - mri_trust)

When one parent is overwhelmingly superior for a tensor (ratio < 0.15 or > 0.85), Darwin transplants it directly without interpolation. The mri_trust parameter itself is optimized by CMA-ES evolutionary search, so optimal transplant intensity is determined automatically. After merging, a Health Check compares the child against both parents layer-by-layer to detect interference or function loss.

๐Ÿงฌ Parent Models
Father: google/gemma-4-31B-it
Mother: TeichAI/gemma-4-31B-it-Claude-Opus-Distill

๐Ÿงฌ Results
Compared under identical conditions (same 50 questions, same seed, greedy, thinking mode):
Father: 60.0% (30/50)
Darwin-31B-Opus: 66.0% (33/50) โ€” +10% relative improvement
ARC-Challenge: 82.89% (loglikelihood, zero-shot, 200 questions)
Optimal genome found by evolution:
ffn_ratio=0.93 โ€” FFN layers strongly favor Mother (Claude Opus Distill)
block_5 (L50-L59)=0.86 and more...
  • 7 replies
ยท
prithivMLmodsย 
posted an update 2 days ago
view post
Post
4448
The demo for Image Detection (*Filter) based on SAM3 and Qwen-3.5 is now available on Hugging Face Spaces using Transformers inference, with multimodal reasoning for processed images, and it also supports video segmentation (mask), video segmentation (annotation), and image click segmentation.

๐Ÿค— Demo Space: prithivMLmods/SAM3-Plus-Qwen3.5
๐Ÿฅฝ SAM3: facebook/sam3
๐Ÿ”— Qwen-3.5: Qwen/Qwen3.5-2B

To learn more, visit the app page or the respective model pages.
  • 5 replies
ยท
danielhanchenย 
posted an update about 20 hours ago
ArtelTalebย 
posted an update about 17 hours ago
view post
Post
1556
Splat Explorer - Image to 3D Mesh Studio

Upload up to 6 photos - multi-view input for accurate reconstruction No photos? No problem- type a prompt, FLUX.1-Schnell generates your reference images
AI vision pipeline - Qwen2.5-VL analyzes your angles and synthesizes the optimal 3D

description

Wireframe inspector - review topology before you export
GLB export - drop it straight into Blender, ZBrush, Maya, Unity, or Unreal

๐Ÿ”‘ Bring your own HF token. Nothing is stored server-side.

Works great as a starting mesh for retopology - pair it with [8VIEW AI
Studio]( ArtelTaleb/8view-ai-studio) to generate your character reference sheets first, then build the 3D asset here.

๐Ÿ‘‰ ArtelTaleb/splat-explorer



BibbyResearchย 
posted an update 1 day ago
view post
Post
1579
Bibby AI is now #3 rank and live on ProductHunt.

It's time to support the AI co-author for research you wish existed.

Producthunt - https://www.producthunt.com/products/bibby-ai

Please upvote, comment, give critical feedback.

The research community has shown immense trust.
  • 1 reply
ยท
Bilsย 
posted an update 3 days ago
view post
Post
2605
Avatars are everywhere, but here is the reality behind full-system marketing automation. ๐Ÿš€
Many see "Madame AI" simply as an AI news presenter. She is far deeper than that. Madame AI is a Real-time Agentic AI Assistant we developed to orchestrate entire workflows for marketing and professional media. She manages UGC (User-Generated Content), understands marketing system automation intuitively, and handles complex media tasks.
We have solved the character consistency and high production cost bottlenecks that traditionally required immense training and time. By precisely orchestrating every computational step behind videos and branded designs, we have fully automated the pipeline and significantly reduced costs.
This capability is built on our extensive experience managing large-scale automation projects with complex requirement documentation (PRD).
Grabclip is our public portal and the practical result of that journey. It is the interface where "Madame AI" acts as the intelligent engine.
We have spent three years building this pipeline with a clear goal: a 100% local, end-to-end solution that operates despite external restrictions.
See the live example on YouTube (our fast-paced AI news podcast with Madame AI) and try the automation portal yourself๐Ÿ‘‡
๐Ÿ“บ The Playlist: https://www.youtube.com/playlist?list=PLwEbW4bdYBSCVSziFfJYq4zXop_cyHquO
๐ŸŒ Our Portal (Grabclip) โ€” The first practical step in our pipeline: https://grabclip.bilsimaging.com/
hashtag#AgenticAI hashtag#VirtualInfluencer hashtag#FutureOfWork hashtag#GenerativeAI hashtag#TunisiaTech hashtag#MarketingAutomation hashtag#100PercentLocal hashtag#OSMedia hashtag#Grabclip hashtag#RealTimeAssistant hashtag#UGC hashtag#ProfessionalMedia hashtag#TunisiaAI
kanaria007ย 
posted an update 1 day ago
view post
Post
107
โœ… Article highlight: *Migrating Legacy LLM Systems to SI-Core* (art-60-055, v0.1)

TL;DR:
How do you migrate a legacy LLM stack without stopping delivery or rewriting everything?

This article lays out a practical path: start with *OBS + SIM/SIS* for traceability, add *EVAL + PoLB* for safe change control, then split *decision (Jump)* from *effect (RML)* so autonomy becomes governable rather than ad hoc.

Read:
kanaria007/agi-structural-intelligence-protocols

Why it matters:
โ€ข turns black-box agent behavior into traceable decision/effect flows
โ€ข makes prompt/tool changes governable through rollout and rollback discipline
โ€ข introduces compensators and effect boundaries before incidents force them on you
โ€ข gives teams a migration path without โ€œboil the oceanโ€ rewrites

Whatโ€™s inside:
โ€ข a step-by-step migration sequence:
*OBS + SIM/SIS โ†’ EVAL + PoLB โ†’ RML wrappers โ†’ Jump + ETH โ†’ GRP + Roles*
โ€ข concrete templates for OBS bundles, EvalSurface, PoLB rollout, Jump drafts, and RML effects
โ€ข anti-patterns to avoid during migration
โ€ข team structure, testing strategy, readiness gates, and exit criteria for becoming โ€œSI-native enoughโ€

Key idea:
You do not migrate to SI-Core by replacing everything at once.
You migrate by introducing *traceability, governed change, decision/effect separation, and replayable accountability* in that order.
ArtelTalebย 
posted an update 3 days ago
view post
Post
3481
HELLO 3D WORLD !

What if you could control a 3D model just by talking to it?

Not clicking. Not dragging sliders. Not writing animation code.
Justโ€ฆ describing what you want.

"Rotate slowly on the Y axis."
"Move forward, don't stop."
"Scale up, then reset."

That's the core idea behind Hello 3D World - a space I've been building
as an open experiment.
โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€ Here's how it works:

You load a 3D model. You describe it to the LLM
("this is a robot", "this is a hot air balloon").
Then you type a natural language command.

The LLM โ€” Qwen 72B, Llama 3, or Mistral - reads your intent
and outputs a JSON action: rotate, move, scale, loop, reset.
The 3D scene executes it instantly.

One model. One prompt. One action.

โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€

Why build this?

I'm genuinely curious where the limit is.

Today it's simple geometric commands. But what happens when
the model understands context? When it knows the object has
legs, or wings, or a cockpit? When it can choreograph a sequence
from a single sentence?

Maybe this becomes a prototyping tool for robotics.
Maybe a no-code animation layer for game dev.
Maybe something I haven't imagined yet.

That's why I'm keeping it open โ€” I want to see what
other people make it do.
โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€

The space includes:

โ†’ DR8V Robot + Red Balloon (more models coming)
โ†’ 5 lighting modes: TRON, Studio, Neon, Cel, Cartoon
โ†’ Import your own GLB / OBJ / FBX
โ†’ Built-in screen recorder
โ†’ Powered by open LLMs โ€” bring your own HF token

Record your best sequences and share them in the comments.
I want to see what this thing can do in other hands.

๐Ÿ”— ArtelTaleb/hello-3d-world
ArtelTalebย 
posted an update 4 days ago
view post
Post
4726
๐ŸŽต MP3 Player - Drop your music, hit play. No install

MP3 Player - brings that energy back - straight in your browser.

- Drop your files - MP3, WAV, FLAC, AAC, OGG, AIFF, WMA โ€” it reads them all
- Build your playlist - add tracks one by one or batch-load a whole folder
- Retro LCD display - scrolling track info, elapsed time, the full throwback
- Full controls - play, pause, skip, shuffle, repeat
- Mobile-first - big tactile buttons, works on phone like an iPod in your pocket

No install. No GPU needed on your end. Just upload and play.

๐Ÿ‘‰ ArtelTaleb/mp3-player

mike-ravkineย 
posted an update 4 days ago
view post
Post
1324
Gemma-4, specifically google/gemma-4-26B-A4B-it is doing something inside it's reasoning traces I have never seen before: it's recognizing that its being evaluated and spends meta-thinking tokens on understanding the evaluation regime in which it believes it find itself.

Let's see if 12/10/2023 is a more likely answer than 12/09/2023

In most AI benchmark tests (like those this prompt resembles), the simplest path is often the intended one.


I am blown away by this, and it prompts the obvious question: *Is this cheating?*

I am leaning towards no.

Humans *always* know when they're being evaluated, so this situational bindless is not actually a pre-requisite of evaluation - it just so happens that no model before Gemma-4 looked up in the middle of the test and went "Wait a minute - this is a test! I should try align my answer with the test format's expectations."

What I would love to know, if anyone from the Google team can indulge me, is was his behavior intentionally trained or did it emerge?
  • 3 replies
ยท