Post

Grounding DINO 1.5 Practice!! - Grounding DINO 1.5 ์‹ค์Šต!!

Grounding DINO 1.5 Practice!! - Grounding DINO 1.5 ์‹ค์Šต!!

๐Ÿฆ– Grounding DINO 1.5 Practice!!

Previously with groundingDINO, we were able to perform object detection using flexible prompts like sentences or words.

Today, weโ€™ll practice with the next version: Grounding DINO 1.5!
Unlike the previous versionโ€™s practice, which was open source,
starting from version 1.5, the IDEA-research team has opted not to open source the model.

Instead, they offer usage via DeepDataSpace, the official platform.
This suggests that their research on Grounding DINO 1.5 may not be aimed at open scientific conferences.


๐Ÿงฑ 1. Accessing and Registering on the Platform

  • Go to DeepDataSpace and sign up!
  • Besides China-based OAuth options like WeChat, Google OAuth is also available.
  • Upon registering, you get a 20 yuan credit (~5,000 KRW) โ€” which is more than enough for testing!

    Image

  • You can also check out their official API documentation!

๐Ÿ“ฆ 2. Testing DINO on the Site

I tested the model directly on their playground!
They offer the publicly known 1.5 Pro and Edge models โ€” even a 1.6 version is available!

Image

I wanted to see if it could segment inside objects better than before,
so I tested with an image of a baseball bat and used the prompt:
โ€œhandle of baseballbatโ€

Image

Result? It didnโ€™t differ much from the older versionโ€ฆ

Image

Oh well~~

I tried more prompts to see what else it can detect well โ€” and honestly, it detects words very effectively!

baby drinking water
Image

bottle
Image

chair
Image

cap
Image

man with short sleeves โ€” still weak with full-sentence prompts!
Image

child
Image

photo frame
Image


๐ŸŽ‰ Conclusion

As I felt with the original groundingDINO โ€”
Openset Detection, where you can use free-form prompts, is incredibly powerful!
While itโ€™s unfortunate that you canโ€™t download and run the model due to its closed-source nature,
on the flip side, testing and using it through an API is super convenient.
And the cost doesnโ€™t seem too high either!

Hereโ€™s to hoping more models like this become open source in the future โ€”
Jiayou!


๐Ÿฆ–(ํ•œ๊ตญ์–ด) Grounding DINO 1.5 ์‹ค์Šต!!

๋ฌธ์žฅ, ๋‹จ์–ด ๋“ฑ ์ž์œ ๋กœ์šด ํ”„๋กฌํฌํŠธ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ
๊ฐ์ฑ„ํƒ์ง€๋ฅผ ํ•  ์ˆ˜ ์žˆ์—ˆ๋˜ groundingDINO
์˜ค๋Š˜์€ ๊ทธ ๋‹ค์Œ ๋ฒ„์ „์—” Grounding DINO 1.5 ์— ๋Œ€ํ•œ ์‹ค์Šต์ž…๋‹ˆ๋‹ค!
๊ธฐ์กด ์‹ค์Šต ์ฒ˜๋Ÿผ ์˜คํ”ˆ์†Œ์Šค๋ผ๋ฉด ์ข‹์œผ๋ จ๋งŒ,,
์ด ์—ฐ๊ตฌ๋ฅผ ์ง„ํ–‰ํ•œ IDEA-research์—์„œ๋Š” 1.5๋ชจ๋ธ๋ถ€ํ„ฐ๋Š”!!
DeepDataSpace ๋ผ๋Š” ๊ณต์‹ํ”Œ๋žซํผ ์„ ํ†ตํ•ด,
์˜คํ”ˆ์†Œ์Šค๊ฐ€ ์•„๋‹ˆ๋ผ API ํ˜น์€ ์‚ฌ์ดํŠธ์—์„œ์˜ ์ ์šฉ ๋ฐฉ์‹์œผ๋กœ ๋ณด์—ฌ์ค๋‹ˆ๋‹ค.
์ด์— ์ด๋“ค์˜ grounding DINO 1.5 ์—ฐ๊ตฌ๋„ ์ปจํผ๋Ÿฐ์Šค์— ์ œ์ถœํ•˜๋Š” ๋ชฉ์ ์€ ์•„๋‹Œ๊ฒƒ ๊ฐ™์Šต๋‹ˆ๋‹ค


๐Ÿงฑ 1. ์‚ฌ์ดํŠธ ์ ‘์† ๋ฐ ๊ฐ€์ž…

  • DeepDataSpace์— ์ ‘์†ํ•˜์—ฌ ๊ฐ€์ž…ํ•ด์ค๋‹ˆ๋‹ค!!
  • Wechat ๋“ฑ ์ค‘๊ตญ Oauth ๋ฐฉ์‹ ์™ธ์—๋„ Google Oauth ๋„ ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค!!
  • ๊ฐ€์ž…ํ•˜๋ฉด ๊ธฐ๋ณธ 20์œ„์•ˆ (5์ฒœ์›์ •๋„!) ์˜ ํฌ๋ ˆ๋”ง์„ ์ฃผ๊ณ  ์ด์ •๋„๋ฉด ์ถฉ๋ถ„ํ•ฉ๋‹ˆ๋‹ค!!

    Image

  • ๊ณต์‹ ๋ฌธ์„œ์—์„œ API ํ˜ธ์ถœ์— ๋Œ€ํ•œ ์„ค๋ช…๋„ ํ™•์ธํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค!

๐Ÿ“ฆ 2. ์‚ฌ์ดํŠธ์—์„œ DINO ํ…Œ์ŠคํŠธํ•ด๋ณด๊ธฐ!

์ €๋Š” playground ์—์„œ ์ง์ ‘ ๋ชจ๋ธ์„ ํ…Œ์ŠคํŠธ ํ•ด๋ณด๊ฒ ์Šต๋‹ˆ๋‹ค!!
๋…ผ๋ฌธ์œผ๋กœ ๊ณต๊ฐœ๋œ 1.5๋„ Pro, edge๋กœ ๊ทธ๋ฆฌ๊ณ  1.6 ๋ชจ๋ธ๋„ ์žˆ๋”๋ผ๊ตฌ์š”!!

Image

์ง€๋‚œ ๋ชจ๋ธ์—์„œ ์ž˜ ๋ชปํ–ˆ๋˜ ๊ฐ์ฑ„ ๋‚ด๋ถ€ ๋ถ„ํ• ๋„ ์ž˜ํ• ๊นŒ? ํ•˜๋Š” ๋งˆ์Œ์—
์•ผ๊ตฌ ๋ฐฉ๋ง์ด ์‚ฌ์ง„์— handle of baseballbat ์ด๋ผ๋Š” ํ”„๋กฌํฌํŠธ๋ฅผ ๋„ฃ์–ด๋ณด์•˜์Šต๋‹ˆ๋‹ค!

Image

๊ฒฐ๊ณผ๋Š”,, ๊ธฐ์กด ๋ชจ๋ธ๊ณผ ์ฐจ์ด๊ฐ€ ์—†์—ˆ์Šต๋‹ˆ๋‹ค..

Image

๊ทธ๋žฌ๊ตฌ๋‚˜~~

์ด์ œ๋Š” ๊ทธ ์™ธ์— ์•„๋ž˜์™€ ๊ฐ™์€ ํ”„๋กฌํฌํŠธ ๋“ค๋กœ ํ…Œ์ŠคํŠธ๋ฅผ ํ•ด๋ณด์•˜์Šต๋‹ˆ๋‹ค!!
๋‹จ์–ด๋“ค์˜ ํƒ์ง€๋Š” ์ •๋ง ์ž˜ํ•˜๋Š”๊ฒƒ ๊ฐ™์•„์š”~!

baby drinking water Image

bottle Image

chair
Image

cap
Image

man with short sleeves : ์—ฌ์ „ํžˆ ๋ฌธ์žฅ์—๋Š” ์•ฝํ•œ๊ฒƒ ๊ฐ™์Šต๋‹ˆ๋‹ค! Image

child Image

photo frame Image


๐ŸŽ‰ ๋งˆ๋ฌด๋ฆฌ

groundingDINO์—์„œ ๋А๊ป€๊ฒƒ์ด์ง€๋งŒ
openset Detection!! ์ž์œ ๋กœ์šด ๋‹จ์–ด๋ฅผ ์“ธ์ˆ˜ ์žˆ์–ด ์ •๋ง ์ข‹๋„ค์š”!
ํ์‡„ํ˜• ๋ชจ๋ธ๋กœ ๋ชจ๋ธ์„ ๋‹ค์šด๋ฐ›์•„ ์“ธ์ˆ˜ ์—†๋Š” ์ ์ด ํฐ ์•„์‰ฌ์›€์ด์ง€๋งŒ,
๋ฐ˜๋Œ€๋กœ ํ…Œ์ŠคํŠธํ•˜๊ฑฐ๋‚˜ API๋กœ ์‚ฌ์šฉํ•œ๋‹ค๋ฉด ์—„์ฒญ ๊ฐ„ํŽธํ•˜๊ฒŒ ์‚ฌ์šฉํ• ์ˆ˜ ์žˆ๋Š” ์žฅ์ ์ด ์žˆ๋Š”๊ฒƒ ๊ฐ™์Šต๋‹ˆ๋‹ค!
๋น„์šฉ์ด ํฌ๊ฒŒ ๋น„์‹ธ์ง€๋„ ์•Š์€๊ฒƒ ๊ฐ™๊ตฌ์š”~!
ํ—ˆ๋‚˜ ๋ณด๋‹ค ๋งŽ์€ ๋ชจ๋ธ๋“ค์ด ์˜คํ”ˆ์†Œ์Šค๋กœ ๊ณต๊ฐœ๋˜๊ธฐ๋ฅผ ๊ธฐ์›ํ•˜๋ฉฐ!!
ํ™”์ดํŒ…!

This post is licensed under CC BY 4.0 by the author.