-
Notifications
You must be signed in to change notification settings - Fork 620
Issues
is:issue state:open
is:issue state:open
Issue creation is restricted in this repository
Search results
[Proposal] Add Cohere2 / Command-A interleaved-attention adapter (Cohere2ForCausalLM)
complexity-moderateModerately complicated issues for people who have intermediate experience with the codeModerately complicated issues for people who have intermediate experience with the codegood first issueGood for newcomersGood for newcomershelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1474 In TransformerLensOrg/TransformerLens;[Proposal] Add BD3LM block-diffusion adapter (BD3LM)
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codehelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1473 In TransformerLensOrg/TransformerLens;[Proposal] Add RecurrentGemma (Griffin) adapter (RecurrentGemmaForCausalLM)
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codehelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1472 In TransformerLensOrg/TransformerLens;[Proposal] Add HRM-Text two-timescale recurrent adapter (HrmTextForCausalLM)
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codehelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1471 In TransformerLensOrg/TransformerLens;[Proposal] Add Ouro LoopLM looped-depth adapter (OuroForCausalLM)
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codehelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1470 In TransformerLensOrg/TransformerLens;[Proposal] Add Raven/Huginn depth-recurrent adapter (RavenForCausalLM)
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codehelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1469 In TransformerLensOrg/TransformerLens;[Proposal] Add Jamba attention+Mamba interleave adapter (JambaForCausalLM)
complexity-moderateModerately complicated issues for people who have intermediate experience with the codeModerately complicated issues for people who have intermediate experience with the codehelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1468 In TransformerLensOrg/TransformerLens;[Proposal] Add Arcee squared-ReLU sparse-MLP adapter (ArceeForCausalLM)
complexity-moderateModerately complicated issues for people who have intermediate experience with the codeModerately complicated issues for people who have intermediate experience with the codegood first issueGood for newcomersGood for newcomershelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1467 In TransformerLensOrg/TransformerLens;[Proposal] Add DeepSeek-V4 hybrid-attention MoE adapter (DeepseekV4ForCausalLM)
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codenew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1466 In TransformerLensOrg/TransformerLens;[Proposal] Add LLaDA masked-diffusion adapter (LLaDAModelLM)
complexity-highVery complicated changes for people to address who are quite familiar with the codeVery complicated changes for people to address who are quite familiar with the codehelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1465 In TransformerLensOrg/TransformerLens;[Proposal] Add Qwen2-MoE shared+routed expert adapter (Qwen2MoeForCausalLM)
complexity-simpleSimple issues, which may be good for beginnersSimple issues, which may be good for beginnersgood first issueGood for newcomersGood for newcomershelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1464 In TransformerLensOrg/TransformerLens;[Proposal] Add Zamba2 shared-attention hybrid adapter (Zamba2ForCausalLM)
complexity-moderateModerately complicated issues for people who have intermediate experience with the codeModerately complicated issues for people who have intermediate experience with the codehelp wantedExtra attention is neededExtra attention is needednew-architectureThis card involves adding a new architecture .This card involves adding a new architecture .TransformerBridgeBug specific to the new TransformerBridge systemBug specific to the new TransformerBridge systemStatus: Open.#1463 In TransformerLensOrg/TransformerLens;