python Scrapy 초간단 사용법 정리

Notice

Recent Posts

Recent Comments

Link

« 2025/05 »
일	월	화	수	목	금	토
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

Tags more

Archives

관리 메뉴

All thing of the world!

python Scrapy 초간단 사용법 정리 본문

IT/python

python Scrapy 초간단 사용법 정리

WorldSeeker 2023. 4. 29. 17:55

Python으로 작성된 Scrapy 사용법에 대해 정리한다.
(Not Scrappy! Scrapy!)
Scrapy란 웹스크래핑(Web Scrapping) 혹은 웹크롤링(Web Crawling)을 빠르고 안정적으로 대량 데이터를 쉽게 추출하기 위한 프레임워크다.

1. python에 Scrapy 설치

pip install scrapy

2. 터미널에서 Scrapy 프로젝트 생성

scrapy startproject <프로젝트명>
예) scrapy startproject testproject

3. 터미널에서 Scrapy spider(웹크롤러) 생성

scrapy genspider <spider명칭> <스크랩핑할 웹주소>
예) scrapy genspider testspider www.naver.com

※주의) 웹주소 기입시 "http://" 혹은 "https://" 없이 입력할 것

4. 터미널에서 Scrapy 웹스크래핑 실행

scrapy crawl <생성한 스파이더명>
예) scrapy crawl testspider

Just 4 step만으로도 testspider가 웹스크래핑을 시작한다.
물론 상세하게 들어가면 상당히 많은 것을 customizing(조정)해서 사용할 수 있다.
자세한 사용방법은 아래 scrapy 공식사이트의 영어원문을 참고하자.
https://scrapy.org/

Scrapy | A Fast and Powerful Scraping and Web Crawling Framework

Portable, Python written in Python and runs on Linux, Windows, Mac and BSD

scrapy.org

저작자표시 비영리 변경금지 (새창열림)

'IT > python' 카테고리의 다른 글

python mysql 데이터베이스 접속 코딩(connection code) (0)	2023.04.29
python 데이터베이스 접속(connection code) 정리 (0)	2023.04.29
merge 설명 : python pandas 함수 (0)	2022.03.27
concat 설명 : python pandas 함수 (0)	2022.03.27
read_csv 설명 : python pandas 함수 (0)	2022.03.23

'IT/python' Related Articles

Comments

All thing of the world!

python Scrapy 초간단 사용법 정리 본문

python Scrapy 초간단 사용법 정리

'IT > python' 카테고리의 다른 글

티스토리툴바