Page 215 - Touhpad Ai
P. 215
Program 23: To convert a column in a DataFrame to data and number format
import pandas as pd
# Sample data with incorrect formats for date and number
data = {
'Order_ID': [101, 102, 103, 104],
'Order_Date': ['2025/01/12', '2025/2/15', 'March 5, 2025', '2025.04.10'],
'Total_Amount': ['1000', '1500.75', '2000', 'Not Available']
}
# Create DataFrame
df = pd.DataFrame(data)
# Show the original DataFrame
print("Original DataFrame:")
print(df)
# Convert 'Order_Date' column to datetime
df['Order_Date'] = pd.to_datetime(df['Order_Date'], errors='coerce')
# Convert 'Total_Amount' column to numeric
df['Total_Amount'] = pd.to_numeric(df['Total_Amount'], errors='coerce')
# Show the updated DataFrame
print("\nUpdated DataFrame:")
print(df)
Output:
Original DataFrame:
Order_ID Order_Date Total_Amount
0 101 2025/01/12 1000
1 102 2025/2/15 1500.75
2 103 March 5, 2025 2000
3 104 2025.04.10 Not Available
Updated DataFrame:
Order_ID Order_Date Total_Amount
0 101 2025-01-12 1000.00
1 102 2025-02-15 1500.75
2 103 NaT 2000.00
3 104 NaT NaN
Trimming Extra Spaces
Extra spaces in text can cause mismatches.
df['Name'] = df['Name'].str.strip()
Theoretical and Practical Aspects of Data Processing 213

