Big data processing problems encountered with numpy - Code World

Big data processing problems encountered with numpy

Others 2019-08-17 23:44:31 views: null

When reading a .csv file more than four million lines of data using numpy throws the following exception:

numpy.core._exceptions.MemoryError: Unable to allocate array with shape (4566386, 23) and data type <U20

The following is my source code:

import numpy as np
import matplotlib.pyplot as mp
import sklearn.ensemble as se
import sklearn.metrics as sm
headers = None
data = []
with open ('/home/tarena/桌面/i-80.csv','r') as f:
    for i,line in enumerate( f.readlines()):
        if i==0:
            headers=line.split(',')[2:]
        else:
            data.append(line.split(',')[2:])
headers = np.array(data)
data = np.array(data)
print(headers.shape)
print(data.shape)

The following are the results:

Traceback (most recent call last):
  File "/home/tarena/桌面/read_forest.py", line 13, in <module>
    headers = np.array(data)
numpy.core._exceptions.MemoryError: Unable to allocate array with shape (4566386, 23) and data type <U20

Process finished with exit code 1

Although the error, but still we got the result.

Members bigwigs, is there a solution?

Guess you like

Origin www.cnblogs.com/bitrees/p/11369327.html

Big data processing problems encountered with numpy

Problems and solutions encountered in traditional big data migration

Build a big data base assembly problems encountered in the process (integration)

Big data development-10 fatal problems encountered in production

[Personal Record] Problems encountered in the SimpleRev problem solving process: data processing and inverse modular operation

PyCharm import problems encountered numpy package

Problems encountered in audio processing for beginners (updating)

Big Data processing ideas

db big data processing

Data cleaning for big data processing

Data extraction for big data processing

numpy array using data processing

20 python data processing numpy

Applying Numpy to realize data processing

Two basic problems of data processing

Detailed tutorial on deploying big data platform and answers to problems encountered (installing ambari2.7.3+HDP3.1.0 under ubuntu18.04)

Big Data Processing Training: Big Data Processing Process

Application of Kafka in Big Data Processing

What is the process of big data processing?

Use Keras problems encountered in the process of training data

Problems encountered vue component data is changed, but not rendering

Analysis and solution of problems encountered in parallel for processing double loops in openmp

Problems encountered by BertTokenizer in processing mixed Chinese and English sequences

[Big data] Mass data processing method

Data Mining Practice (13)--Big Data Processing

Military big data - structured data analysis and processing

Python scientific computing: fast data processing with numpy

problems encountered

Problems often encountered when data crawlers crawl data

Financial data processing problems and solution sharing

Recommended

Ranking

#2019110700005

What materials and procedures are required for patent transfer

What is the blockchain Ethereum triplet state root transaction root receipt root

Front-end study notes 04 --- About the insertion of html pictures and videos

Documents required for the filing of WeChat Mini Programs in special industries, the filing process of WeChat Mini Programs in special industries, how to file WeChat Mini Programs in special industries

2017 Qingdao-site tournament I The Squared Mosquito Coil

[BZOJ3165][HEOI2013]Segment (line segment tree without marking)

Kettle series: KettleEasyExpand, an open source Kettle universal plugin by Ma Jinju

The latest tutorial on making framework for iOS

DAX Section 6: Statistical Functions

Daily

More

2024-05-14(9)

2024-05-13(8)

2024-05-12(28)

2024-05-11(32)

2024-05-10(34)

2024-05-09(32)

2024-05-08(18)

2024-05-07(34)

2024-05-06(6)

2024-05-05(0)