Master's Seminar Machine Learning
summer semester 2025
Organizational:
- Talk: 30 Min + 10 min discussion
- Handout: 2 pages
- Hand in slides 2 weeks before presentation so that we can go through
Papers for the Machine Learning Seminar
Large Language Models:
- Eureka: Human-Level Reward Design via Coding Large Language Models https://arxiv.org/abs/2310.12931
- Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft https://arxiv.org/abs/2312.09238
- DeepSeek v2 https://arxiv.org/pdf/2405.04434
- DeepSeek v3 https://arxiv.org/abs/2412.19437
- DeepSeek R1 https://arxiv.org/abs/2501.12948
- Chain-of-Thought Prompting Elicits Reasoning in Large Language Models https://arxiv.org/abs/2201.11903
- Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizershttps://arxiv.org/abs/2309.08532
- Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting https://arxiv.org/abs/2305.04388
- Large Language Models Are Human-Level Prompt Engineers https://arxiv.org/abs/2211.01910
- Automatic Prompt Optimization with "Gradient Descent" and Beam Search https://arxiv.org/abs/2305.03495
- Rho-1: Not All Tokens Are What You Need https://arxiv.org/abs/2404.07965
- s1: Simple test-time scaling h ttps://arxiv.org/abs/2501.19393
- Finetuned Language Models Are Zero-Shot Learners https://arxiv.org/abs/2109.01652
Fine-Tuning:
- LoRA: Low-Rank Adaptation of Large Language Models https://arxiv.org/abs/2106.09685
- QLoRA: Efficient Finetuning of Quantized LLMs https://arxiv.org/abs/2305.14314
3D Deep Learning:
- Neural Radiance Fieldshttps://arxiv.org/abs/2003.08934
- pixelNeRF: Neural Radiance Fields from One or Few Images https://arxiv.org/abs/2012.02190
- Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis https://arxiv.org/abs/2104.00677
- Learned Initializations for Optimizing Coordinate-Based Neural Representations https://arxiv.org/abs/2012.02189
- FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization https://arxiv.org/abs/2303.07418
- SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image https://arxiv.org/abs/2204.00928
- 3D Gaussian Splatting for Real-Time Radiance Field Rendering https://repo-sam.inria.fr/fungraph/3d-gaussian-splatting/
ARC Challenge:
- Graphs, Constraints, and Search for the Abstraction and Reasoning Corpus https://arxiv.org/abs/2210.09880
- LLMs and the Abstraction and Reasoning Corpus: Successes, Failures, and the Importance of Object-based Representations https://arxiv.org/abs/2305.18354
- Tackling the Abstraction and Reasoning Corpus (ARC) with Object-centric Models and the MDL Principle https://arxiv.org/abs/2311.00545
- CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay https://arxiv.org/abs/2402.04858
- Getting 50% (SoTA) on ARC-AGI with GPT-4o https://redwoodresearch.substack.com/p/getting-50-sota-on-arc-agi-with-gpt
- Combining Induction and Transduction for Abstract Reasoning https://arxiv.org/abs/2411.02272
- The LLM ARChitect: Solving ARC-AGI Is A Matter of Perspective https://da-fr.github.io/arc-prize-2024/the_architects.pdf
- The Surprising Effectiveness of Test-Time Training for Abstract Reasoning https://arxiv.org/abs/2411.07279
- CompressARC: https://iliao2345.github.io/blog_posts/arc_agi_without_pretraining/arc_agi_without_pretraining.html
- Searching Latent Program Spaces https://arxiv.org/abs/2411.08706
Interesting facts about Neural Networks:
- Intriguing Properties of Neural Networks: h ttps://arxiv.org/abs/1312.6199
- The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks: https://arxiv.org/abs/1803.03635
- Neural Tangent Kernel: https://arxiv.org/abs/1806.07572
Other:
- Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction: https://arxiv.org/abs/2404.02905
- Group Equivariant Convolutional Networks https://arxiv.org/abs/1602.07576
- Deep Sets https://arxiv.org/abs/1703.06114
- Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos https://arxiv.org/abs/2206.11795
- Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory https://arxiv.org/abs/2305.17144
- Voyager: An Open-Ended Embodied Agent with Large Language Models https://arxiv.org/abs/2305.16291
- First return, then explore https://arxiv.org/abs/2004.12919
- Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm https://arxiv.org/abs/1712.01815
- Discovering faster matrix multiplication algorithms with reinforcement learning https://www.nature.com/articles/s41586-022-05172-4