from selenium import webdriver
import requests
from bs4 import BeautifulSoup
import urllib
from urllib import request
from urllib.request import urlopen
import os

options = webdriver.ChromeOptions()
options.add_argument('--headless')
options.add_argument('--no-sandbox')
options.add_argument('--disable-dev-shm-usage')


wd = webdriver.Chrome('chromedriver',options=options)

for idx, search in enumerate(search_dict.keys()):
    answer = list(search_dict.values())[idx]
    os.makedirs(f'./images/{search}_{answer}')
    
    url = f"https://search.naver.com/search.naver?where=image&sm=tab_jum&query={search}"
    wd.get(url)
    
    for i in range(800):
        wd.execute_script("window.scrollTo(0, document.body.scrollHeight);")
        
    soup = BeautifulSoup(wd.page_source, 'html.parser')
    tags = soup.select("img._img")


    for i in range(len(tags)):
        file_path = f'./images/{search}_{answer}/images' + str(i) + '.jpg'
        
        
        urllib.request.urlretrieve(tags[i]["src"], file_path)
        print(file_path)

일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

AI Inventors History#2-5 "Personalcolor AI 개발 - Nginx 연동" (1)	2020.09.01
AI Inventors History#2-4 "Personalcolor AI 개발 - Flask Framework 구현" (1)	2020.08.27
AI Inventors History#2-3 "Personalcolor AI 개발 - AI Train Data 생성" (0)	2020.08.25
AI Inventors History#2-2 "Personalcolor AI 개발 - 얼굴인식과 얼굴 피부 추출" (0)	2020.08.14
AI Inventors History#2-0 "Personalcolor AI 개발" (0)	2020.08.12

AI Inventors 개발 Story

AI Inventors 개발 Story

AI Inventors History#2-1 "Personalcolor AI 개발 - 이미지 크롤링(네이버/구글 이미지 크롤링)" 본문

AI Inventors History#2-1 "Personalcolor AI 개발 - 이미지 크롤링(네이버/구글 이미지 크롤링)"

1. 크롤링(Crawling) 이란?

2. 크롤링 방법?

3. 세부 크롤링 방식

'IT 개발 프로젝트 > Personalcolor AI 개발' 카테고리의 다른 글

티스토리툴바