分享web开发知识

注册/登录|最近发布|今日推荐

主页 IT知识网页技术软件开发前端开发代码编程运营维护技术分享教程案例
当前位置:首页 > 教程案例

处理json大文件

发布时间:2023-09-06 02:05责任编辑:顾先生关键词:jsjson
import jsonimport pymysql# 读取review数据,并写入数据库# 导入数据库成功,总共4736897条记录def prem(db): ???cursor = db.cursor() ???cursor.execute("SELECT VERSION()") ???data = cursor.fetchone() ???print("Database version : %s " % data) ?# 结果表明已经连接成功 ???cursor.execute("DROP TABLE IF EXISTS review") ?# 习惯性 ???sql = """CREATE TABLE review ( ????????????review_id ?VARCHAR(100), ????????????user_id ?VARCHAR(100), ????????????business_id VARCHAR(200), ????????????stars INT, ????????????text VARCHAR(10000) NOT NULL, ????????????useful INT, ????????????funny INT, ????????????cool INT)""" ???cursor.execute(sql) ?# 根据需要创建一个表格def reviewdata_insert(db): ???with open(‘E:/data/yelp_data/dataset/review.json‘, encoding=‘utf-8‘) as f: ???????i = 0 ???????while True: ???????????i += 1 ???????????print(u‘正在载入第%s行......‘ % i) ???????????try: ???????????????lines = f.readline() ?# 使用逐行读取的方法 ???????????????review_text = json.loads(lines) ?# 解析每一行数据 ???????????????result = [] ???????????????result.append((review_text[‘review_id‘], review_text[‘user_id‘], ???????????review_text[‘business_id‘],review_text[‘stars‘], review_text[‘text‘], review_text[‘useful‘], ???????????????????????????review_text[‘funny‘], review_text[‘cool‘])) ???????????????print(result) ???????????????inesrt_re = "insert into review(review_id, user_id, business_id, stars, text, useful, ???????????funny, cool) values (%s, %s, %s, %s,%s, %s,%s, %s)" ???????????????cursor = db.cursor() ???????????????cursor.executemany(inesrt_re, result) ???????????????db.commit() ???????????except Exception as e: ???????????????db.rollback() ???????????????print(str(e)) ???????????????breakif __name__ == "__main__": ?# 起到一个初始化或者调用函数的作用 ???db = pymysql.connect("localhost", "root", "password(你的密码)", "数据库名称", charset=‘utf8‘) ???cursor = db.cursor() ???prem(db) ???reviewdata_insert(db) ???cursor.close()

  

处理json大文件

原文地址:https://www.cnblogs.com/yspass/p/9346446.html

知识推荐

我的编程学习网——分享web前端后端开发技术知识。 垃圾信息处理邮箱 tousu563@163.com 网站地图
icp备案号 闽ICP备2023006418号-8 不良信息举报平台 互联网安全管理备案 Copyright 2023 www.wodecom.cn All Rights Reserved