Manipulating Pandas dataframes and pivoting¶

import pandas
from numpy import *

f = pandas.read_csv('fruit_sales_log.csv')
f.head()

Having selected and copied the data in a spreadsheet, for example:

# fix headers
f.columns = [item.strip() for item in f.columns]
f.columns

Index(['Date', 'Seller', 'Item', 'Amount'], dtype='object')

f[['Seller','Amount']]

f.pivot_table(values='Amount',index='Seller',columns='Item',aggfunc=sum,margins=True).fillna(0)

	Date	Seller	Item	Amount
0	07/01/17	Yiqing	watermelons	6.21
1	07/01/17	Sakar	watermelons	12.33
2	07/01/17	Yiqing	apples	18.02
3	07/02/17	Sakar	watermelons	1.95
4	07/04/17	Yiqing	watermelons	14.88

Item	apples	peaches	watermelons	All
Seller
Anthony	34.64	17.90	33.74	86.28
Jonathan	5.11	12.57	55.24	72.92
Katherine	2.93	0.00	13.92	16.85
Megan	12.06	16.69	28.28	57.03
Sakar	18.12	16.33	38.31	72.76
Samuel	6.66	29.46	44.22	80.34
Yiqing	58.43	34.74	21.09	114.26
All	137.95	127.69	234.80	500.44