分享web开发知识

注册/登录|最近发布|今日推荐

主页 IT知识网页技术软件开发前端开发代码编程运营维护技术分享教程案例
当前位置:首页 > 代码编程

urllib使用四--urlencode

发布时间:2023-09-06 01:58责任编辑:苏小强关键词:url

urllib.urlencode

把字典数据转换为URL编码

# -*- coding: cp936 -*-import urllibparams = {‘score‘:100,‘name‘:‘爬虫基础‘,‘comment‘:‘very good‘}qs = urllib.urlencode(params)print(qs)

编码后跟在URL后面传递参数:

comment=very+good&score=100&name=%C5%C0%B3%E6%BB%F9%B4%A1

逆向

在urlparse包里

urlparse.parse_qs方法--返回字典

import urllibimport urlparseparams = {‘score‘:100,‘name‘:‘爬虫基础‘,‘comment‘:‘very good‘}qs = urllib.urlencode(params)dic = urlparse.parse_qs(qs)print(dic)

运行结果:

{‘comment‘: [‘very good‘], ‘score‘: [‘100‘], ‘name‘: [‘\xc5\xc0\xb3\xe6\xbb\xf9\xb4\xa1‘]}

对某个url提取参数:

# -*- coding: cp936 -*-import urllibimport urlparseurl = ‘https://www.baidu.com/s?ie=utf-8&f=8&rsv_bp=0&rsv_idx=1&tn=baidu&wd=python%20%E5%8F%91%E9%80%81%E9%82%AE%E4%BB%B6&rsv_pq=8ddee8730003c015&rsv_t=fc02P4%2By%2FPKzaFFtaqGCGLCxPvpVojkM6zg7pgczZB%2FeZAQkXhsuWRPpHqs&rqlang=cn&rsv_enter=1&rsv_sug3=14&rsv_sug1=10&rsv_sug7=101‘result = urlparse.urlparse(url)print(result)

先用urlparse.urlparse(url)方法获取result

ParseResult(scheme=‘https‘, netloc=‘www.baidu.com‘, path=‘/s‘, params=‘‘, query=‘ie=utf-8&f=8&rsv_bp=0&rsv_idx=1&tn=baidu&wd=python%20%E5%8F%91%E9%80%81%E9%82%AE%E4%BB%B6&rsv_pq=8ddee8730003c015&rsv_t=fc02P4%2By%2FPKzaFFtaqGCGLCxPvpVojkM6zg7pgczZB%2FeZAQkXhsuWRPpHqs&rqlang=cn&rsv_enter=1&rsv_sug3=14&rsv_sug1=10&rsv_sug7=101‘, fragment=‘‘)

对result中的query参数中提取

import urllibimport urlparseurl = ‘https://www.baidu.com/s?ie=utf-8&f=8&rsv_bp=0&rsv_idx=1&tn=baidu&wd=python%20%E5%8F%91%E9%80%81%E9%82%AE%E4%BB%B6&rsv_pq=8ddee8730003c015&rsv_t=fc02P4%2By%2FPKzaFFtaqGCGLCxPvpVojkM6zg7pgczZB%2FeZAQkXhsuWRPpHqs&rqlang=cn&rsv_enter=1&rsv_sug3=14&rsv_sug1=10&rsv_sug7=101‘result = urlparse.urlparse(url)dic = urlparse.parse_qs(result.query)print(dic)

运行结果:

{‘wd‘: [‘python \xe5\x8f\x91\xe9\x80\x81\xe9\x82\xae\xe4\xbb\xb6‘], ‘f‘: [‘8‘], ‘rsv_enter‘: [‘1‘], ‘rsv_bp‘: [‘0‘], ‘rsv_t‘: [‘fc02P4+y/PKzaFFtaqGCGLCxPvpVojkM6zg7pgczZB/eZAQkXhsuWRPpHqs‘], ‘rsv_idx‘: [‘1‘], ‘tn‘: [‘baidu‘], ‘rqlang‘: [‘cn‘], ‘rsv_sug7‘: [‘101‘], ‘rsv_pq‘: [‘8ddee8730003c015‘], ‘rsv_sug1‘: [‘10‘], ‘rsv_sug3‘: [‘14‘], ‘ie‘: [‘utf-8‘]}

urllib使用四--urlencode

原文地址:https://www.cnblogs.com/chillytao-suiyuan/p/9147811.html

知识推荐

我的编程学习网——分享web前端后端开发技术知识。 垃圾信息处理邮箱 tousu563@163.com 网站地图
icp备案号 闽ICP备2023006418号-8 不良信息举报平台 互联网安全管理备案 Copyright 2023 www.wodecom.cn All Rights Reserved