{"id":689,"date":"2018-02-25T23:15:35","date_gmt":"2018-02-25T14:15:35","guid":{"rendered":"https:\/\/t-office.blue\/?p=689"},"modified":"2018-02-25T23:16:10","modified_gmt":"2018-02-25T14:16:10","slug":"post-689","status":"publish","type":"post","link":"https:\/\/wp.t-office.blue\/?p=689","title":{"rendered":"python Web\u30b9\u30af\u30ec\u30a4\u30d4\u30f3\u30b0 \u5b9f\u8df5\u5165\u9580\u3092\u3057\u3088\u3046\u3068\u601d\u3063\u305f\u3089\u3046\u307e\u304f\u3067\u304d\u306a\u304b\u3063\u305f\u8a71"},"content":{"rendered":"<p>\u3053\u3061\u3089\u306e<a href=\"https:\/\/qiita.com\/Azunyan1111\/items\/9b3d16428d2bcc7c9406\">Python Web\u30b9\u30af\u30ec\u30a4\u30d4\u30f3\u30b0 \u5b9f\u8df5\u5165\u9580<\/a>\u3092\u52c9\u5f37\u3057\u3088\u3046\u3068\u8a66\u3057\u3066\u3044\u305f\u3068\u3053\u308d\u3001\u81ea\u5206\u306e\u74b0\u5883\u306b\u306furllib2\u3068\u3044\u3046\u30d1\u30c3\u30b1\u30fc\u30b8\u304c\u30a4\u30f3\u30b9\u30c8\u30fc\u30eb\u3067\u304d\u306a\u304f\u3066urllib3\u3068\u3044\u3046\u30d1\u30c3\u30b1\u30fc\u30b8\u3092\u4f7f\u308f\u306a\u3044\u3068\u3044\u3051\u306a\u3044\u3063\u307d\u3044\u3002\u305d\u3057\u3066\u3001\u30bb\u30ad\u30e5\u30ea\u30c6\u30a3\u30fc\u306b\u95a2\u3059\u308b\u30a8\u30e9\u30fc\u3082\u3067\u308b\u3068\u3044\u3046\u306e\u3067\u3001\u5bfe\u7b56\u3092\u3057\u307e\u3057\u305f\u3002<\/p>\n<p><script async src=\"\/\/pagead2.googlesyndication.com\/pagead\/js\/adsbygoogle.js\"><\/script><br \/>\n<ins class=\"adsbygoogle\"\n     style=\"display:block\"\n     data-ad-format=\"fluid\"\n     data-ad-layout-key=\"-ei+6a+g-c3+k8\"\n     data-ad-client=\"ca-pub-4003048870046052\"\n     data-ad-slot=\"1589408991\"><\/ins><\/p>\n<p><script>\n     (adsbygoogle = window.adsbygoogle || []).push({});\n<\/script><\/p>\n<p>\u00a0<\/p>\n<h2>urllib3\u3092\u4f7f\u3046\u305f\u3081\u306b\u306f<\/h2>\n<p>\u53c2\u8003\u307e\u3067\u306burllib2\u3092\u30a4\u30f3\u30b9\u30c8\u2212\u30eb\u3057\u3088\u3046\u3068\u3059\u308b\u3068\u4ee5\u4e0b\u306e\u3088\u3046\u306a\u30a8\u30e9\u30fc\u304c\u3067\u307e\u3059\u3002<\/p>\n<pre>$ pip install urllib2\nCollecting urllib2\n Could not find a version that satisfies the requirement urllib2 (from versions: )\nNo matching distribution found for urllib2<\/pre>\n<p>\u3061\u306a\u307f\u306b\u30a4\u30f3\u30b9\u30c8\u30fc\u30eb\u3055\u308c\u3066\u3044\u308b\u30e9\u30a4\u30d6\u30e9\u30ea\u3092\u78ba\u8a8d\u3057\u3066\u307f\u308b\u3068<\/p>\n<pre>$ pip freeze\n\u30fb\u30fb\u30fb\nunicodecsv==0.14.1\nurllib3==1.22\n# coding: UTF-8\nwcwidth==0.1.7\n\u30fb\u30fb\u30fb<\/pre>\n<p>urllib3\u304c\u5165\u3063\u3066\u307e\u3059\u3002\u3068\u3044\u3046\u3053\u3068\u3067\u3001\u7279\u306b\u4f55\u3082\u305b\u305a\u4f7f\u3046\u3053\u3068\u306f\u3067\u304d\u307e\u3059\u3002<br \/>\n\u300c\u4f7f\u3046\u3053\u3068\u306f\u3001\u3001\u3001\u300d\u7b11\u3044<\/p>\n<p>\u00a0<\/p>\n<h2>\u30e9\u30a4\u30d6\u30e9\u30ea\u30a4\u30f3\u30b9\u30c8\u30fc\u30eb<\/h2>\n<pre><span class=\"nv\">$ <\/span>pip <span class=\"nb\">install <\/span>beautifulsoup4<\/pre>\n<p>\u3053\u308c\u306f\u305d\u306e\u307e\u307e\u3044\u3051\u308b\u3068\u601d\u3044\u307e\u3059\u3002<\/p>\n<h2>\u65e5\u7d4c\u65b0\u805e\u306e\u30da\u30fc\u30b8\u30bf\u30a4\u30c8\u30eb\u53d6\u5f97\u7528\u30bd\u30fc\u30b9\u30b3\u30fc\u30c9<\/h2>\n<div class=\"code-lang\"><span class=\"bold\">getNikkeiWebPageTitle.py<\/span><\/div>\n<div class=\"highlight\">\n<pre><span class=\"c\"># coding: UTF-8<\/span>\n<span class=\"kn\">import<\/span> <span class=\"nn\">urllib2<\/span>\n<span class=\"kn\">from<\/span> <span class=\"nn\">bs4<\/span> <span class=\"kn\">import<\/span> <span class=\"n\">BeautifulSoup<\/span>\n\n<span class=\"c\"># \u30a2\u30af\u30bb\u30b9\u3059\u308bURL<\/span>\n<span class=\"n\">url<\/span> <span class=\"o\">=<\/span> <span class=\"s\">\"http:\/\/www.nikkei.com\/\"<\/span>\n\n<span class=\"c\"># URL\u306b\u30a2\u30af\u30bb\u30b9\u3059\u308b html\u304c\u5e30\u3063\u3066\u304f\u308b \u2192 <html><head><title>\u7d4c\u6e08\u3001\u682a\u4fa1\u3001\u30d3\u30b8\u30cd\u30b9\u3001\u653f\u6cbb\u306e\u30cb\u30e5\u30fc\u30b9:\u65e5\u7d4c\u96fb\u5b50\u7248<\/title><\/head><body....<\/span>\n<span class=\"n\">html<\/span> <span class=\"o\">=<\/span> <span class=\"n\">urllib2<\/span><span class=\"o\">.<\/span><span class=\"n\">urlopen<\/span><span class=\"p\">(<\/span><span class=\"n\">url<\/span><span class=\"p\">)<\/span>\n\n<span class=\"c\"># html\u3092BeautifulSoup\u3067\u6271\u3046<\/span>\n<span class=\"n\">soup<\/span> <span class=\"o\">=<\/span> <span class=\"n\">BeautifulSoup<\/span><span class=\"p\">(<\/span><span class=\"n\">html<\/span><span class=\"p\">,<\/span> <span class=\"s\">\"html.parser\"<\/span><span class=\"p\">)<\/span>\n\n<span class=\"c\"># \u30bf\u30a4\u30c8\u30eb\u8981\u7d20\u3092\u53d6\u5f97\u3059\u308b \u2192 <title>\u7d4c\u6e08\u3001\u682a\u4fa1\u3001\u30d3\u30b8\u30cd\u30b9\u3001\u653f\u6cbb\u306e\u30cb\u30e5\u30fc\u30b9:\u65e5\u7d4c\u96fb\u5b50\u7248<\/title><\/span>\n<span class=\"n\">title_tag<\/span> <span class=\"o\">=<\/span> <span class=\"n\">soup<\/span><span class=\"o\">.<\/span><span class=\"n\">title<\/span>\n\n<span class=\"c\"># \u8981\u7d20\u306e\u6587\u5b57\u5217\u3092\u53d6\u5f97\u3059\u308b \u2192 \u7d4c\u6e08\u3001\u682a\u4fa1\u3001\u30d3\u30b8\u30cd\u30b9\u3001\u653f\u6cbb\u306e\u30cb\u30e5\u30fc\u30b9:\u65e5\u7d4c\u96fb\u5b50\u7248<\/span>\n<span class=\"n\">title<\/span> <span class=\"o\">=<\/span> <span class=\"n\">title_tag<\/span><span class=\"o\">.<\/span><span class=\"n\">string<\/span>\n\n<span class=\"c\"># \u30bf\u30a4\u30c8\u30eb\u8981\u7d20\u3092\u51fa\u529b<\/span>\n<span class=\"k\">print<\/span> <span class=\"n\">title_tag<\/span>\n\n<span class=\"c\"># \u30bf\u30a4\u30c8\u30eb\u3092\u6587\u5b57\u5217\u3092\u51fa\u529b<\/span>\n<span class=\"k\">print<\/span> <span class=\"n\">title<\/span><\/pre>\n<\/div>\n<p>\u3082\u3061\u308d\u3093\u3046\u307e\u304f\u884c\u304d\u307e\u305b\u3093\u3002<\/p>\n<pre>$ python3 getNikkeiWebPageTitle1.py\n File \"getNikkeiWebPageTitle1.py\", line 22\n print title_tag\n ^\nSyntaxError: Missing parentheses in call to 'print'. Did you mean print(title_tag)?<\/pre>\n<p>\u307e\u305aprint \u306eSyntaxError\u304c\u51fa\u3066\u3044\u308b\u306e\u3067\u4fee\u6b63<\/p>\n<pre># \u30bf\u30a4\u30c8\u30eb\u8981\u7d20\u3092\u51fa\u529b\n#print title_tag\nprint(title_tag)\n\n# \u30bf\u30a4\u30c8\u30eb\u3092\u6587\u5b57\u5217\u3092\u51fa\u529b\n# print title\nprint(title)<\/pre>\n<p>\u5b9f\u884c\u3057\u3066\u307f\u308b\u3068\u307e\u305f\u307e\u305f\u3001\u9055\u3046\u30a8\u30e9\u30fc\u304c\u3067\u307e\u3059\u3002<\/p>\n<pre>$ python3 getNikkeiWebPageTitle1.py\nTraceback (most recent call last):\n File \"getNikkeiWebPageTitle1.py\", line 3, in <module>\n import urllib2\nModuleNotFoundError: No module named 'urllib2'<\/pre>\n<p>urllib3\u3092\u4f7f\u3046\u3068\u3044\u3046\u3053\u3068\u306a\u306e\u3067\u305d\u306e\u3042\u305f\u308a\u3092\u4fee\u6b63<\/p>\n<pre>import urllib3\n\nurl = \"http:\/\/www.nikkei.com\/\"\n#html = urllib3.urlopen(url)\nhttp = urllib3.PoolManager()\nresponse = http.request('GET',url)\n\n# html\u3092BeautifulSoup\u3067\u6271\u3046\n#soup = BeautifulSoup(html, \"html.parser\")\nsoup = BeautifulSoup(response.data.decode('UTF-8'),\"lxml\")<\/pre>\n<p>\u30b3\u30e1\u30f3\u30c8\u30a2\u30a6\u30c8\u304c\u3082\u3068\u306e\u30bd\u30fc\u30b9\u3067\u305d\u306e\u4e0b\u304c\u3001\u4fee\u6b63\u3057\u305f\u90e8\u5206\u3067\u3059\u3002<br \/>\n\u3053\u306e\u30b3\u30fc\u30c9\u3092\u5b9f\u884c\u3059\u308b\u3068<\/p>\n<pre>\/Users\/USERNAME\/anaconda3\/lib\/python3.6\/site-packages\/urllib3\/connectionpool.py:858: InsecureRequestWarning: Unverified HTTPS request is being made. Adding certificate verification is strongly advised. See: https:\/\/urllib3.readthedocs.io\/en\/latest\/advanced-usage.html#ssl-warnings\n InsecureRequestWarning)\n<title>\u7d4c\u6e08\u3001\u682a\u4fa1\u3001\u30d3\u30b8\u30cd\u30b9\u3001\u653f\u6cbb\u306e\u30cb\u30e5\u30fc\u30b9:\u65e5\u7d4c\u96fb\u5b50\u7248<\/title>\n\u7d4c\u6e08\u3001\u682a\u4fa1\u3001\u30d3\u30b8\u30cd\u30b9\u3001\u653f\u6cbb\u306e\u30cb\u30e5\u30fc\u30b9:\u65e5\u7d4c\u96fb\u5b50\u7248<\/pre>\n<p>InsecureRequestWarning\u3068\u3044\u3046\u8b66\u544a\u304c\u51fa\u3066\u3044\u307e\u3059\u3002<\/p>\n<h2>InsecureRequestWarning\u3092\u6d88\u3059<\/h2>\n<p><a href=\"https:\/\/urllib3.readthedocs.io\/en\/latest\/user-guide.html#ssl\">\u53c2\u8003<\/a><\/p>\n<pre><span class=\"n\">$ pip<\/span> <span class=\"n\">install<\/span> <span class=\"n\">certifi<\/span><\/pre>\n<pre><span class=\"n\">$ pip<\/span> <span class=\"n\">install<\/span> <span class=\"n\">urllib3<\/span><span class=\"p\">[<\/span><span class=\"n\">secure<\/span><span class=\"p\">]<\/span><\/pre>\n<p>\u3092\u5b9f\u884c<\/p>\n<pre><span class=\"kn\">import<\/span> <span class=\"nn\">certifi\n<\/span><\/pre>\n<pre><span class=\"n\">http<\/span> <span class=\"o\">=<\/span> <span class=\"n\">urllib3<\/span><span class=\"o\">.<\/span><span class=\"n\">PoolManager<\/span><span class=\"p\">(<\/span><span class=\"n\">cert_reqs<\/span><span class=\"o\">=<\/span><span class=\"s1\">'CERT_REQUIRED'<\/span><span class=\"p\">,<\/span><span class=\"n\">ca_certs<\/span><span class=\"o\">=<\/span><span class=\"n\">certifi<\/span><span class=\"o\">.<\/span><span class=\"n\">where<\/span><span class=\"p\">())\n<\/span><\/pre>\n<p>\u3053\u306e\uff12\u884c\u3092\u8ffd\u52a0\u3057\u307e\u3059\u3002\u305d\u3046\u3059\u308b\u3068\u3002\u30a8\u30e9\u30fc\u306a\u304f\u5b9f\u884c\u3067\u304d\u307e\u3059\u3002<\/p>\n<p>\u30b9\u30af\u30ec\u30a4\u30d4\u30f3\u30b0\u3092\u3046\u307e\u304f\u4f7f\u3046\u3068\u304b\u306a\u308a\u4fbf\u5229\u305d\u3046\u3067\u3059\u306d(^^)<\/p>\n<p><script async src=\"\/\/pagead2.googlesyndication.com\/pagead\/js\/adsbygoogle.js\"><\/script><br \/>\n<ins class=\"adsbygoogle\"\n     style=\"display:block\"\n     data-ad-format=\"fluid\"\n     data-ad-layout-key=\"-ei+6a+g-c3+k8\"\n     data-ad-client=\"ca-pub-4003048870046052\"\n     data-ad-slot=\"1589408991\"><\/ins><\/p>\n<p><script>\n     (adsbygoogle = window.adsbygoogle || []).push({});\n<\/script><\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u3053\u3061\u3089\u306ePython Web\u30b9\u30af\u30ec\u30a4\u30d4\u30f3\u30b0 \u5b9f\u8df5\u5165\u9580\u3092\u52c9\u5f37\u3057\u3088\u3046\u3068\u8a66\u3057\u3066\u3044\u305f\u3068\u3053\u308d\u3001\u81ea\u5206\u306e\u74b0\u5883\u306b\u306furllib2\u3068\u3044\u3046\u30d1\u30c3\u30b1\u30fc\u30b8\u304c\u30a4\u30f3\u30b9\u30c8\u30fc\u30eb\u3067\u304d\u306a\u304f\u3066urllib3\u3068\u3044\u3046\u30d1\u30c3\u30b1\u30fc\u30b8\u3092\u4f7f\u308f\u306a\u3044\u3068\u3044\u3051\u306a\u3044\u3063\u307d\u3044\u3002\u305d\u3057\u3066\u3001\u30bb [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[33,5],"tags":[69,70,71],"class_list":["post-689","post","type-post","status-publish","format-standard","hentry","category-python","category-5","tag-python","tag-70","tag-71"],"_links":{"self":[{"href":"https:\/\/wp.t-office.blue\/index.php?rest_route=\/wp\/v2\/posts\/689","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wp.t-office.blue\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/wp.t-office.blue\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/wp.t-office.blue\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/wp.t-office.blue\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=689"}],"version-history":[{"count":1,"href":"https:\/\/wp.t-office.blue\/index.php?rest_route=\/wp\/v2\/posts\/689\/revisions"}],"predecessor-version":[{"id":690,"href":"https:\/\/wp.t-office.blue\/index.php?rest_route=\/wp\/v2\/posts\/689\/revisions\/690"}],"wp:attachment":[{"href":"https:\/\/wp.t-office.blue\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=689"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/wp.t-office.blue\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=689"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/wp.t-office.blue\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=689"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}