DataSci Ocean

Python 中的 Integer Cache 介紹

a = b = 0, a is b; a = b = 257, but a is not b

Oct 3．5 min read．Python 基礎教學

article thumbnail

[論文介紹] Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

徹底了解如何訓練出具有良好「視覺能力」的 Vision-Language Model

Jul 29．30 min read．論文介紹

article thumbnail

[論文介紹] Better & Faster Large Language Models via Multi-token Prediction

誰說 LLM 一定要一次預測一個 Token，預測多個不行嗎？

Jul 18．8 min read．論文介紹

article thumbnail

[論文介紹] Tuning LayerNorm in Attention: Towards Efficient Multi-Modal LLM Finetuning

只訓練不到 1% 的參數，就能夠讓 LLM 變成 Multimodal LLM

Jul 8．8 min read．論文介紹

article thumbnail

[論文介紹] GAIA: A Benchmark for General AI Assistants

GAIA －衡量你的 Agent 算不算 General Assistant！

Jun 27．7 min read．論文介紹

article thumbnail

[論文介紹] ChatEval: Towards Better LLM-Based Evaluators Through Multi-Agent Debate

LLM Agent 是什麼？Agent 之間如何進行 Debate 來完成任務？

Jun 23．15 min read．論文介紹

article thumbnail

[論文介紹] Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

來自 Meta FAIR 的 BTX：更有效率的訓練 LLM 精通各項領域

Apr 24．15 min read．論文介紹

article thumbnail

[論文介紹] Sparse Upcycling

學習如何將 Dense Model 轉化為 Sparse MoE

Apr 10．8 min read．論文介紹

article thumbnail