AI Today27 Nov 2024

Model-Based Transfer Learning for Contextual Reinforcement Learning | #ai #mit #rl #genai #ml #2024

Paper: https://arxiv.org/pdf/2408.04498 This research introduces Model-Based Transfer Learning (MBTL), a novel framework for improving the efficiency and robustness of deep reinforcement learning (RL) in contextual Markov Decision Processes (CMDPs). MBTL strategically selects training tasks to maximize generalization performance across a range of tasks by modeling both the performance set point using Gaussian processes and the generalization gap as a function of contextual similarity. Theoretical analysis proves sublinear regret, and experiments on urban traffic and continuous control benchmarks demonstrate significant sample efficiency improvements (up to 50x) compared to traditional methods. The method's effectiveness is shown to be relatively insensitive to the underlying RL algorithm and hyperparameters. ai , model , mit, genai, generativeai, artificialintelligence , arxiv , research , paper , publication, reinforcement learning, rl , ml

Upptäck Premium

Prova 14 dagar kostnadsfritt

Skaffa Premium

Avsnitt(30)

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model | #ai #2025 #genai #google

Paper: https://arxiv.org/pdf/2501.17161 This research paper compares supervised fine-tuning (SFT) and reinforcement learning (RL) for post-training foundation models. Using novel and existing tasks i...

7 Feb 202516min

Deepseek Janus-Pro: Unified Multimodal Understanding and Generation | #ai #2025 #genai #deepseek

Paper: https://github.com/deepseek-ai/Janus/blob/main/janus_pro_tech_report.pdf Github: https://github.com/deepseek-ai/Janus/tree/main?tab=readme-ov-file The paper introduces Janus-Pro, an improved m...

30 Jan 202516min

Memory Layers at Scale | #ai #2024 #genai #meta

Paper: https://arxiv.org/pdf/2412.09764 This research paper explores the effectiveness of memory layers in significantly enhancing large language models (LLMs). By incorporating a trainable key-value...

11 Jan 202514min

Large Concept Models: Language Modeling in a Sentence Representation Space | #ai #2024 #genai

Paper: https://scontent-dfw5-1.xx.fbcdn.net/... This research paper introduces Large Concept Models (LCMs), a novel approach to language modeling that operates on sentence embeddings instead of indi...

6 Jan 202529min

DeepSeek v3 | #ai #2024 #genai

Technical Report: https://arxiv.org/pdf/2412.19437 Github: https://github.com/deepseek-ai/DeepSe... This research paper introduces DeepSeek-V3, a 671-billion parameter Mixture-of-Experts (MoE) large ...

31 Dec 202428min

VISION TRANSFORMERS NEED REGISTERS | #ai #2024 #genai #meta

Paper: https://arxiv.org/pdf/2309.16588 This research paper examines artifacts in vision transformer feature maps, specifically high-norm tokens appearing in non-informative image areas. The authors ...

30 Dec 202433min

Byte Latent Transformer: Scaling Language Models with Patches | #ai #2024 #genai

Paper: https://arxiv.org/pdf/2412.09871v1.pdf The paper introduces the Byte Latent Transformer (BLT), a novel large language model architecture that processes raw byte data without tokenization. BLT ...

27 Dec 202421min

CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models | #ai #2024 #genai

This research paper introduces CosyVoice 2, an improved streaming speech synthesis model. Building upon its predecessor, CosyVoice 2 utilizes advancements in large language models (LLMs) and incorpora...

27 Dec 202420min

Premium

99 kr/ månad

Tillgång till alla Premium-poddar
Reklamfritt premium-innehåll
Avsluta när du vill

Prova 14 dagar gratis

Premium

129 kr/ månad

Tillgång till alla Premium-poddar
Reklamfritt premium-innehåll
Avsluta när du vill
Ett extra konto

Prova 14 dagar gratis

Model-Based Transfer Learning for Contextual Reinforcement Learning | #ai #mit #rl #genai #ml #2024

Upptäck Premium

Avsnitt(30)

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model | #ai #2025 #genai #google

Deepseek Janus-Pro: Unified Multimodal Understanding and Generation | #ai #2025 #genai #deepseek

Memory Layers at Scale | #ai #2024 #genai #meta

Large Concept Models: Language Modeling in a Sentence Representation Space | #ai #2024 #genai

DeepSeek v3 | #ai #2024 #genai

VISION TRANSFORMERS NEED REGISTERS | #ai #2024 #genai #meta

Byte Latent Transformer: Scaling Language Models with Patches | #ai #2024 #genai

CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models | #ai #2024 #genai

Allt en och samma app

Noga utvalt innehåll

Fortsätt när du vill

Premium

Premium

Populärt inom Teknik

Berättelserna och rösterna du älskar att lyssna på