beautifulsoup req html 소스 req.text header req.headers status req.status_code http 확인 req.ok Python-Library/Python-Library__BeautifulSoup 2018.12.24
selector test = soup.select('div.tit3 > a[href*=movie]') div id가 tit3이면서 href에 movie가 포함된 애들만 가져오기 Python-Library/Python-Library__BeautifulSoup 2018.12.24
api로 xml 리턴받았을때 json으로 변환하는 방법 key = '키값'year = 2018month = '10'apiUrl = 'http://apis.data.go.kr/B090041/openapi/service/SpcdeInfoService/getHoliDeInfo?solYear='+str(year)+'&solMonth='+month+'&ServiceKey='+key req = requests.get(apiUrl)xpars = xmltodict.parse(req.text)jsonDump = json.dumps(xpars)jsonBody = json.loads(jsonDump) print(jsonBody['response']['body']['items']['item']) Python/Python__works 2018.12.21
정규표현식으로 한글, 특수문자 지우기 testText = 'asdfasdfㅋㅌㅊㅍㅋㅌㅊㅍ1234234가나다라*@*@#*#@*' korean = re.compile('[\u3131-\u3163\uac00-\ud7a3]+') #한글삭제 parseText= re.sub(korean, '', testText) #특수문자 삭제 parseText= re.sub('[-=.#/?:$}]', '', text) Python/Python__works 2018.12.21