Python Pandasの使い方のサンプル

2024年2月18日

環境
Windows 11 Pro 64bit
Python 3.11

pipenv を利用している場合、以下のようにインストールします。
$ pipenv install pandas

プログラムから利用するにはimportが必要です。

import pandas as pd
import numpy as np

df = pd.DataFrame(
    np.random.randn(6, 4),
    index=list('ABCDEF'),
    columns=['col1', 'col2', 'col3', 'col4']
)

print(df)

結果

       col1      col2      col3      col4
A -1.801003  0.567649 -1.500904 -0.239381
B -0.719186 -0.966656  1.726989  0.906241
C  0.242600  2.890311 -0.321324 -0.457311
D -0.999545  0.845195 -0.830634  0.332034
E  0.430159  1.964199  0.162262  0.596390
F  0.145257 -0.418800  0.653118  1.253135

1.オブジェクト生成方法
書式
class pandas.Series(data=None, index=None, dtype=None, name=None, copy=False, fastpath=False)

使用例

>>> pd.Series([54, 21, 94, 74])
0 54
1 21
2 94
3 74
dtype: int64

NumPyの ndarray とは異なり、indexを数値以外にします。

>>> pd.Series([54, 21, 94, 74], index=['A', 'B', 'C', 'D'])
A 54
B 21
C 94
D 74
dtype: int64

dictionaryを渡す形式でも利用できます。

>>> dict = {'A': 54, 'B': 21, 'C': 94, 'D': 74}
>>> pd.Series(dict)
A 54
B 21
C 94
D 74
dtype: int64

Pandasの date_rangeメソッドを利用すると、連続した日付データを生成できます。

>>> date = pd.date_range('20230101', periods=4)
>>> date
DatetimeIndex(['2023-01-01', '2023-01-02', '2023-01-03', '2023-01-04'], dtype='datetime64[ns]', freq='D')
>>>
>>> pd.Series(date)
0 2023-01-01
1 2023-01-02
2 2023-01-03
3 2023-01-04
dtype: datetime64[ns]

Posted by arkgame

関連投稿: