Page 111 of 130 • 1560 Total Articles

createLiveAI

Continue exploring the latest AI breakthroughs, technology insights, and industry analysis. Page 111 of our comprehensive AI news collection.

📰 Latest Intelligence

Showing 12 articles on page 111 of 130

Live feed
Research
📄 arXiv Machine Learning

Decom-Renorm-Merge: Model Merging on the Right Space Improves Multitasking

A new method called Decom-Renorm-Merge (DRM) has been proposed to improve neural network model merging by using Singular Value Decomposition to align weight matrices into a joint space, enabling more effective merging across various architectures. DRM outperforms existing techniques in both full finetuning and low-rank adaptation scenarios, with renormalization identified as a key factor for creating a robust merging process.

research machine-learning +1
Read More
Research
📄 arXiv Machine Learning

DeepRTE: Pre-trained Attention-based Neural Network for Radiative Tranfer

Researchers introduced DeepRTE, a neural network method utilizing pre-trained attention mechanisms to accurately and efficiently solve the steady-state Radiative Transfer Equation, which models radiation propagation in various scientific fields. Numerical experiments demonstrate the approach's high accuracy and computational benefits across applications like atmospheric transfer, heat transfer, and optical imaging.

Deep Learning Transformers
Read More
Research
📄 arXiv Machine Learning

Defining Foundation Models for Computational Science: A Call for Clarity and Rigor

This paper highlights the need for a clear, formal definition of foundation models in computational science, emphasizing core qualities like generality, reusability, and scalability. It introduces the Data-Driven Finite Element Method (DD-FEM), which combines traditional numerical methods with data-driven learning to address challenges such as scalability and physics consistency, providing a foundation for future development in the field.

Machine Learning Computer Vision +1
Read More
Research
📄 arXiv Machine Learning

>-

A new method called DenoiseRotator improves pruning of large language models by redistributing parameter importance through learnable orthogonal transformations, making models more robust to pruning under semi-structured sparsity. Evaluations on models like LLaMA3 and Qwen2.5 show significant reductions in performance degradation, with the approach enhancing perplexity and zero-shot accuracy compared to traditional pruning techniques.

research machine-learning
Read More
Research
📄 arXiv Machine Learning

DINGO: Constrained Inference for Diffusion LLMs

Diffusion LLMs are a promising, efficient alternative to autoregressive models but struggle to enforce formal constraints like regular expressions, limiting their reliability for structured output tasks. To address this, the authors propose DINGO, a dynamic programming-based decoding method that efficiently and provably preserves the model's distribution while strictly satisfying user-defined constraints, significantly improving constrained generation performance.

research llm +1
Read More
Research
📄 arXiv Machine Learning

Directed Graph Grammars for Sequence-based Learning

This work introduces a grammar-based method to represent directed acyclic graphs (DAGs) as unique, sequential derivations over an unambiguous grammar, enabling lossless compression and principled decoding. This compact representation facilitates applications such as graph generation, property prediction, and Bayesian optimization by providing a continuous sequence-based structure for complex DAGs.

research machine-learning
Read More
Research
📄 arXiv Machine Learning

Efficiently Access Diffusion Fisher: Within the Outer Product Span Space

This paper reveals that the diffusion Fisher information in diffusion models can be efficiently approximated using outer products of the score and initial data, rather than relying on computationally intensive auto-differentiation. The proposed algorithms offer improved accuracy and reduced computational cost, with applications demonstrated in likelihood evaluation, optimization, and verifying properties of diffusion maps.

Page 111 of 130 • Showing articles 1321-1332 of 1560