-
Polynomial Implicit Neural Representations For Large Diverse Datasets
Paper • 2303.11424 • Published -
The Spectral Bias of Polynomial Neural Networks
Paper • 2202.13473 • Published -
Categories of Differentiable Polynomial Circuits for Machine Learning
Paper • 2203.06430 • Published -
Learning Hierarchical Polynomials with Three-Layer Neural Networks
Paper • 2311.13774 • Published
Collections
Discover the best community collections!
Collections including paper arxiv:2411.03884
-
Ultra-Sparse Memory Network
Paper • 2411.12364 • Published • 23 -
Hyper-Connections
Paper • 2409.19606 • Published • 24 -
Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models
Paper • 2411.03884 • Published • 28 -
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling
Paper • 2501.16975 • Published • 31
-
GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models
Paper • 2406.14550 • Published • 4 -
Mixture-of-Agents Enhances Large Language Model Capabilities
Paper • 2406.04692 • Published • 60 -
Meta Prompting for AGI Systems
Paper • 2311.11482 • Published • 4 -
Symbolic Learning Enables Self-Evolving Agents
Paper • 2406.18532 • Published • 12
-
Contrastive Learning for Many-to-many Multilingual Neural Machine Translation
Paper • 2105.09501 • Published -
Cross-modal Contrastive Learning for Speech Translation
Paper • 2205.02444 • Published -
ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs
Paper • 2210.03052 • Published -
Diffusion Glancing Transformer for Parallel Sequence to Sequence Learning
Paper • 2212.10240 • Published • 1
-
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
Paper • 2410.23743 • Published • 63 -
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 68 -
Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models
Paper • 2411.03884 • Published • 28 -
MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models
Paper • 2502.00698 • Published • 24
-
Polynomial Implicit Neural Representations For Large Diverse Datasets
Paper • 2303.11424 • Published -
The Spectral Bias of Polynomial Neural Networks
Paper • 2202.13473 • Published -
Categories of Differentiable Polynomial Circuits for Machine Learning
Paper • 2203.06430 • Published -
Learning Hierarchical Polynomials with Three-Layer Neural Networks
Paper • 2311.13774 • Published
-
Contrastive Learning for Many-to-many Multilingual Neural Machine Translation
Paper • 2105.09501 • Published -
Cross-modal Contrastive Learning for Speech Translation
Paper • 2205.02444 • Published -
ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs
Paper • 2210.03052 • Published -
Diffusion Glancing Transformer for Parallel Sequence to Sequence Learning
Paper • 2212.10240 • Published • 1
-
Ultra-Sparse Memory Network
Paper • 2411.12364 • Published • 23 -
Hyper-Connections
Paper • 2409.19606 • Published • 24 -
Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models
Paper • 2411.03884 • Published • 28 -
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling
Paper • 2501.16975 • Published • 31
-
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
Paper • 2410.23743 • Published • 63 -
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 68 -
Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models
Paper • 2411.03884 • Published • 28 -
MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models
Paper • 2502.00698 • Published • 24
-
GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models
Paper • 2406.14550 • Published • 4 -
Mixture-of-Agents Enhances Large Language Model Capabilities
Paper • 2406.04692 • Published • 60 -
Meta Prompting for AGI Systems
Paper • 2311.11482 • Published • 4 -
Symbolic Learning Enables Self-Evolving Agents
Paper • 2406.18532 • Published • 12