Lesson 24 of 30
Data Analysis with Pandas
DataFrames, Series, reading CSV/Excel, filtering, and aggregating data.
Installing Pandas
pip install pandas openpyxl
Creating a DataFrame
import pandas as pd
data = {
"Name": ["Alice", "Bob", "Charlie"],
"Age": [25, 30, 22],
"Score": [88, 72, 95]
}
df = pd.DataFrame(data)
print(df)
Reading Files
df = pd.read_csv("students.csv")
df = pd.read_excel("students.xlsx")
print(df.head()) # first 5 rows
print(df.info()) # dtypes and nulls
print(df.describe()) # statistics
Filtering and Selecting
# Select column
print(df["Name"])
# Filter rows
high = df[df["Score"] > 80]
# Multiple conditions
result = df[(df["Age"] < 28) & (df["Score"] > 85)]
Aggregation
print(df["Score"].mean())
print(df.groupby("Grade")["Score"].agg(["mean", "max"]))
df.to_csv("output.csv", index=False)