'2025/09/18 글 목록

Notice

Recent Posts

Recent Comments

Link

« 2025/09 »
일	월	화	수	목	금	토
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30

Tags more

Archives

Today

Total

관리 메뉴

글쓰기
방명록
RSS
관리

목록2025/09/18 (3)

토니의 연습장

Scaling - file formats

언어 AI (NLP)/LLM & RAG & Agent 2025. 9. 18. 17:40

GPT/Llama 아키텍처

출처 : https://github.com/rasbt/LLMs-from-scratch/tree/main/ch05/07_gpt_to_llama

AI 일반/모델, 아키텍처, 구현 2025. 9. 18. 10:57

LLM train/eval/generate 간단한 예시

train 하면서 주기적으로 evaluation 하고 sample text 를 generation학습하면서 수치적 평가를 병행하며, 동시에 생성 샘플을 통해 성능을 직관적으로 확인해 볼 수 있게 함 def train_model_simple(model, train_loader, val_loader, optimizer, device, num_epochs, eval_freq, eval_iter, start_context, tokenizer): # Initialize lists to track losses and tokens seen train_losses, val_losses, track_tokens_seen = [], [], [] tokens_seen,..

AI 일반/모델, 아키텍처, 구현 2025. 9. 18. 10:36

이전 Prev 1 Next 다음

목록2025/09/18 (3)

토니의 연습장

티스토리툴바