Archives

2025

25 Dec 👀 Visual Attention Sink & VAR 논문 공부 (ICLR 2025)
24 Dec ⚡ StreamingLLM & Attention Sink 논문 공부 (ICLR 2024)
23 Dec ⚙️ Vision Transformers Need Registers 논문 공부 (ICLR 2025)
12 Dec 🔍LocalizationHeads - LVLM을 활용하여 Training-Free로 Segmentation 하기!! (CVPR 2025)
10 Nov 🔍양방향 학습을 통한 Affordance Grounding 문제해결! (ICCV 2025)
03 Nov 🔍 Contrastive Learning을 통한 Affordance Grounding 문제해결! (ICCV 2025)
12 Sep 🐍 Reasoning Mamba: Hypergraph 기반 추론으로 Weakly Supervised Affordance Grounding 강화!
11 Sep 🎭 MaskPrompt: 오픈 보캐뷸러리 Affordance Segmentation을 위한 객체 마스크 프롬프트
08 Sep 🖼️ Qwen2.5-VL: Next-Gen Vision-Language Model with Dynamic Resolution & Long Video Understanding
08 Sep 🎥 LAVAD: Training-Free Video Anomaly Detection with LLMs
08 Sep 📍 GEM: Grounding Everything in Vision-Language Transformers
07 Sep 🔎 VL-SAM Hands-on: VL-SAM 을 실습해보자!
06 Sep 🔎 VL-SAM: Training-Free Open-Ended Object Detection & Segmentation
05 Sep 🔎 CLIP Surgery: A Closer Look at the Explainability of Contrastive Language-Image Pre-training
05 Sep 🧩 PartCLIPSeg: Open-Vocabulary Part-level Segmentation with CLIP Guidance
04 Sep 🔎 ClipSurgery Hands-on: ClipSurgery 을 실습해보자!
02 Sep 🧠 SAM2 Hands-On Practice!! : SAM2 실습!! with Python
01 Sep 🔎 Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively
28 Aug 🧩 Segment Anything, Even Occluded (SAMEO): 가려진 부분까지 세그멘트하는 SAM 확장
26 Aug 🧠 EfficientSAM Hands-On Practice!! : EfficientSAM 실습!! with Python
25 Aug 🧠 EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything — 실전형 SAM의 표준
24 Aug 🧩 RTMDet, SOTA of Real-Time, One-Stage Object Detectors: 실시간, One-Stage Object Detector의 정수
08 Aug 🎨 An Image is Worth One Word: Textual Inversion - 이미지를 `거시기` 화 해보리기!!
04 Aug 📝 TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
02 Aug 📊 Evaluation Metrics in CIRR - CIRR분야의 Metrics 알아보기
01 Aug 📗 『교양으로 읽는 원자력 상식』을 읽고 - Reading “Nuclear Power Basics for the Culturally Curious”
31 Jul 🧠 OSrCIR: Reason-before-Retrieve for Composed Image Retrieval
30 Jul 🧠 [CIReVL] VISION-BY-LANGUAGE FOR TRAINING-FREE COMPOSITIONAL IMAGE RETRIEVAL : First Training Free on CIRR
29 Jul 🧠 CIRCO - Zero-Shot Composed Image Retrieval with Textual Inversion (ICCV 2023)
28 Jul 🧠 CIR - Composed Image Retrieval on Real-life Images : 이미지 탐색의 시작연구!!
26 Jul 🧠 FashionIQ - Fashion Image Retrieval with Natural Language: 패션 이미지 검색의 새로운 표준
25 Jul 👁️ MLLMs Know Where to Look: Training-free Visual Detail Perception
21 Jul 🧠 Notes-guided MLLM Reasoning
20 Jul 🚀Understanding Wikidata -무료료 wiki 데이터 활용하기!!
16 Jul 🧠Lost in the Middle - 긴 문맥에서 언어모델이 진짜 정보를 기억할까?
09 Jul 🔍 WSAG-PLSP: Weakly Supervised 학습을 통한 Affordance Grounding 문제해결!
08 Jul 🔍 WSMA: Multimodal Weak Supervision으로 Egocentric Affordance Grounding 혁신!
07 Jul 📌 LOCATE: Weakly Supervised Affordance Grounding을 위한 Object Part Localization & Transfer
05 Jul [MLflow] LLM 프롬프트 엔지니어링 실험 관리하기 - 체계적인 프롬프트 튜닝과 결과 추적
05 Jul 🚀Understanding MLflow -MLOps의 필수 도구 MLflow 알아보기?!!
02 Jul 🚀 Transformer 파이썬으로 이해하기!
01 Jul 🔤Understanding Tokenizers - Tokenizer 알아보기?!!
25 Jun 🔗 Understanding GLIP - CLIP이해하기!!!
24 Jun 📝Understanding YOLO-World - 실시간 Open-Vocabulary Object Detection의 혁신!!!
23 Jun 📝Understanding CLIP4HOI - CLIP4HOI를 알아보자!!!
22 Jun 📘 『니코마코스 윤리학』을 읽고 - Reading “Nicomachean Ethics”
18 Jun 📝Understanding EZ-HOI - EZ-HOI 알아보기!!
17 Jun 📝Understanding CLIP-Adapter - CLIP-Adapter 알아보기?!!
14 Jun 📝Understanding YOLO - YOLO 알아보기?!!
13 Jun 🖥️ LoRA Hands-On Practice!! : LORA 실습!! with python
12 Jun 📝Understanding BLIP - BLIP 알아보기?!!
11 Jun 📝Understanding FG-CLip - FG-Clip 알아보기?!!
10 Jun 🖥️ FG-Clip Practice!! : FG-Clip 실습!! with python
09 Jun 📝 LoRA: Low-Rank Fine-Tuning for Large Language Models - Understanding LORA- LORA 알아보기?!!
08 Jun 🖥️ SEEM Practice!! - SEEM 실습!! with python. gradio
07 Jun 📝 Understanding SEEM - SEEM(Segment Everything Everywhere All at Once) 알아보기!!
06 Jun 🖥️ Grounding DINO 1.5 Practice!! - Grounding DINO 1.5 실습!!
05 Jun 📝 Understanding LISA - LISA 알아보기?!!
04 Jun 🖥️ LISA Practice!! - Reasoning Segmentation LLM LISA 실습!!
03 Jun 📘 Reading 『Why Stocks Go Up and Down』 - 『주식이 오르고 내리는 이유』를 읽고
29 May 🖥️ Video segmentation with Python using SAM2! - 파이썬 SAM2 실습 : 비디오에서 누끼따기!
25 May 📘 Reading 『The April 3rd Incident』 by Yuhua - 위화의 단편소설집 『4월3일사건』를 읽고
22 May 📘 Reading 『Quantum Studies with Kim』 - 『김상욱의 양자공부』를 읽고
17 May 📘 Reading 『The Accusation』 - 『고발』을 읽고
16 May 🧠 Understanding SAM2 - SAM2 알아보기?!!
15 May 📝 Understanding Grounding DINO!! - Grounding DINO 논문 공부!
15 May AI에서 'Ground'란 무엇인가? Grounding DINO, Grounding SAM, 그리고 Grounded Affordance까지!
14 May 🖥️ Grounded SAM Hands-On with Python! - Grounded SAM 실습 with python!
13 May 📘 Reading *Devenez votre propre psy* - 『마음의 기술』를 읽고
12 May 🖥️ Grounding DINO Practice - Grounding DINO 실습 with python!
11 May 🖥️ DINO Practice: Running Object Detection with Pretrained Models - DINO 실습: 모델을 받아 직접 객체 탐지 해보기!
10 May 오늘의 시 : 행복하다가 - A Poem to Share: While I'm Happy
09 May 📝 DINO: The Evolutionary Object Detection Model of DETR!! - DINO: DETR의 진화형 객체 탐지 모델!! (ICLR 2023)
08 May 🖥️ Object Detection with DETR! Python Practice!! - DETR을 활용한 객체 탐지! 파이썬 실습!!
07 May The Rise and Fall of UAE's Once-Prominent LLM, Falcon - 한때 주목받던 UAE의 LLM, 팰컨의 근황
07 May 📝 The First Transformer-based Image Detection Model!! DETR! - Transformer로 객채 탐지까지!! DETR의 등장!! (CVPR 2020)
06 May 🖥️ DINO Python Experiment!! Super Impressive!! - DINO 파이썬 실습!! 완전 신기해!!
05 May 🖥️ Image segmentation with Python using SAM! - 파이썬으로 누끼따기!? SAM (Segment Anything Model) 실습
04 May 📝 Segment Anything, You are amazing! - 누끼의 괴물, SAM의 등장!! (ICCV, 2023)
03 May 📘 A True Classic, Deserving Its Reputation - 명불허전, 『왕자와 거지』를 읽고
30 Apr 📝 ViT, you can do greater things! - The emergence of DINO!! // ViT, 너는 더 큰일을 할수있어! - DINO의 등장!! (ICCV 2021)
29 Apr The Relentless Rise of Chinese AI Models!! A Look at Qwen3!! - 끝없는 중국AI모델의 발전!! Qwen3 살펴보기!! 🇨🇳🚀
28 Apr 📘 행동으로 옮기기 힘든,, 『초역 부처의 말』을 읽고 - After reading 『The Buddha's Words: Super Translation』
27 Apr 🖥️ Image classification using ViT with Python - 파이썬으로 ViT 모델을 활용, 이미지 분류하기
26 Apr 📘 왜 서양이 글로벌 해게모를 잡게되었을까!? '창발의 시대'을 읽고 - Why Did the West Come to Dominate the World? Reading 『The Verge』
20 Apr 히가시노 게이고의 '회랑정 살인사건'을 읽고 - A Review of 'The Murder in the Corridor Pavilion' by Keigo Higashino
18 Apr 🖥️ Studying CAM with Python! - 파이썬으로 CAM 공부하기
17 Apr 📝 Peeking into the Mind of AI: Understanding CAM! - AI의 속마음을 들여다본다!! CAM 알아보기
14 Apr 📘 김훈 작가의 '흑산(黑山)'를 읽고 - Reading *Heuksan (Black Mountain)* by Kim Hoon
13 Apr Howard Marks' Memo Reading “Nobody Knows (Yet Again)” - 하워드 막스의 편지'Nobody Knows'를 읽고
11 Apr 📘 Reading 'The Power of Money' by Paul Sheard - 폴 시어드의 『돈의 권력』을 읽고
09 Apr Learn How to Write Markdown Files - Markdown 언어로 글쓰는 방법 정리!!
08 Apr Using CLIP with Python - 파이썬으로 CLIP을 사용해보기
07 Apr On April 5th, 2025, Meta unveiled their next-gen multimodal AI model — Llama 4! 🦙🚀 - Meta에서 Llama 4 모델 공개!
06 Apr 📝 Understanding CLIP - CLIP 모델 이해하기
05 Apr Exploring Major Journals in AI - AI와 관련된 주요 저널 알아보기 (feat. h-index)
31 Mar Newly upgraded Sora, now writes well too. - 새로 업그레이드된 소라, 글씨도 잘 써요
29 Mar 맥쿼리인프라의 제 26기 주주총회 참관기 - Review of Attending the Meeting of Shareholders of Macquarie Korea Infrastructure
24 Mar 📝 Image? You Can Do Transformer Too!! - The Emergence of ViT!! - 이미지? 너도 Transformer 할수있어!! - ViT의 등장!! (ICLR 2021)
23 Mar DrFirsts blog has been launched! - 일등박사의 블로그가 개설되었습니다!

Archives

Trending Tags