-
Notifications
You must be signed in to change notification settings - Fork 287
Pull requests: microsoft/onnxruntime-genai
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[v4-pkg W2-stub] Add ModelPackageContext stub directory walker
#2132
opened May 6, 2026 by
xiaoyu-work
Contributor
•
Draft
[v4-pkg W5c-prep] Extract EffectiveSessionOptions helper for non-decoder components
#2131
opened May 6, 2026 by
xiaoyu-work
Contributor
•
Draft
[Qwen3.5] Use LpNormalization for L2-norm in linear-attention Q/K
#2127
opened May 6, 2026 by
xiaofeihan1
Contributor
Loading…
3 tasks done
Add Config::shared_assets_path for shared tokenizer/processor assets
#2126
opened May 6, 2026 by
xiaoyu-work
Contributor
Loading…
Add JSON DOM and RFC 7386 merge_patch to src/json
#2125
opened May 6, 2026 by
xiaoyu-work
Contributor
Loading…
Fix multimodal CUDA pipeline: embedding output persistence causes shape mismatch
#2123
opened May 5, 2026 by
justinchuby
Contributor
Loading…
Pipeline-as-Config: Declarative model dispatch replacing model_type string registry
#2115
opened May 2, 2026 by
justinchuby
Contributor
•
Draft
Fix AppendNextTokensToSequences heap overflow
#2111
opened Apr 30, 2026 by
apsonawane
Contributor
Loading…
Enable graph capture for WebGPU models and DML continuous decoding tests
#2099
opened Apr 24, 2026 by
qjia7
Contributor
Loading…
[Don't merge it] Fix quark quantize weight loading for Qwen3-VL-4B text model
#2082
opened Apr 13, 2026 by
Tianping-amd
Loading…
extend modelbuilder to build Olmo3, SmolLM3 and other models
#2078
opened Apr 10, 2026 by
xadupre
Member
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.