Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2411.03884

Polynomial Implicit Neural Representations For Large Diverse Datasets

Paper • 2303.11424 • Published Mar 20, 2023
The Spectral Bias of Polynomial Neural Networks

Paper • 2202.13473 • Published Feb 27, 2022
Categories of Differentiable Polynomial Circuits for Machine Learning

Paper • 2203.06430 • Published Mar 12, 2022
Learning Hierarchical Polynomials with Three-Layer Neural Networks

Paper • 2311.13774 • Published Nov 23, 2023

Full Paper List

Ultra-Sparse Memory Network

Paper • 2411.12364 • Published Nov 19, 2024 • 23
Hyper-Connections

Paper • 2409.19606 • Published Sep 29, 2024 • 24
Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models

Paper • 2411.03884 • Published Nov 6, 2024 • 28
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Paper • 2501.16975 • Published Jan 28 • 31

This collection is meant for RAG articles 1. Let your LLM generate a few tokens https://www.arxiv.org/abs/2412.11536

GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models

Paper • 2406.14550 • Published Jun 20, 2024 • 4
Mixture-of-Agents Enhances Large Language Model Capabilities

Paper • 2406.04692 • Published Jun 7, 2024 • 60
Meta Prompting for AGI Systems

Paper • 2311.11482 • Published Nov 20, 2023 • 4
Symbolic Learning Enables Self-Evolving Agents

Paper • 2406.18532 • Published Jun 26, 2024 • 12

ByteDance Papers

ByteDance papers collection

Contrastive Learning for Many-to-many Multilingual Neural Machine Translation

Paper • 2105.09501 • Published May 20, 2021
Cross-modal Contrastive Learning for Speech Translation

Paper • 2205.02444 • Published May 5, 2022
ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs

Paper • 2210.03052 • Published Oct 6, 2022
Diffusion Glancing Transformer for Parallel Sequence to Sequence Learning

Paper • 2212.10240 • Published Dec 20, 2022 • 1

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Paper • 2410.23743 • Published Oct 31, 2024 • 63
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 68
Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models

Paper • 2411.03884 • Published Nov 6, 2024 • 28
MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models

Paper • 2502.00698 • Published Feb 2 • 24

Polynomial Implicit Neural Representations For Large Diverse Datasets

Paper • 2303.11424 • Published Mar 20, 2023
The Spectral Bias of Polynomial Neural Networks

Paper • 2202.13473 • Published Feb 27, 2022
Categories of Differentiable Polynomial Circuits for Machine Learning

Paper • 2203.06430 • Published Mar 12, 2022
Learning Hierarchical Polynomials with Three-Layer Neural Networks

Paper • 2311.13774 • Published Nov 23, 2023

ByteDance Papers

ByteDance papers collection

Contrastive Learning for Many-to-many Multilingual Neural Machine Translation

Paper • 2105.09501 • Published May 20, 2021
Cross-modal Contrastive Learning for Speech Translation

Paper • 2205.02444 • Published May 5, 2022
ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs

Paper • 2210.03052 • Published Oct 6, 2022
Diffusion Glancing Transformer for Parallel Sequence to Sequence Learning

Paper • 2212.10240 • Published Dec 20, 2022 • 1

Full Paper List

Ultra-Sparse Memory Network

Paper • 2411.12364 • Published Nov 19, 2024 • 23
Hyper-Connections

Paper • 2409.19606 • Published Sep 29, 2024 • 24
Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models

Paper • 2411.03884 • Published Nov 6, 2024 • 28
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Paper • 2501.16975 • Published Jan 28 • 31

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Paper • 2410.23743 • Published Oct 31, 2024 • 63
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 68
Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models

Paper • 2411.03884 • Published Nov 6, 2024 • 28
MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models

Paper • 2502.00698 • Published Feb 2 • 24

This collection is meant for RAG articles 1. Let your LLM generate a few tokens https://www.arxiv.org/abs/2412.11536

GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models

Paper • 2406.14550 • Published Jun 20, 2024 • 4
Mixture-of-Agents Enhances Large Language Model Capabilities

Paper • 2406.04692 • Published Jun 7, 2024 • 60
Meta Prompting for AGI Systems

Paper • 2311.11482 • Published Nov 20, 2023 • 4
Symbolic Learning Enables Self-Evolving Agents

Paper • 2406.18532 • Published Jun 26, 2024 • 12

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs