Page 215 - Touhpad Ai
P. 215

Program 23: To convert a column in a DataFrame to data and number format

                    import pandas as pd
                    # Sample data with incorrect formats for date and number
                    data = {
                        'Order_ID': [101, 102, 103, 104],
                        'Order_Date': ['2025/01/12', '2025/2/15', 'March 5, 2025', '2025.04.10'],
                        'Total_Amount': ['1000', '1500.75', '2000', 'Not Available']
                    }


                    # Create DataFrame
                    df = pd.DataFrame(data)


                    # Show the original DataFrame
                    print("Original DataFrame:")
                    print(df)


                    # Convert 'Order_Date' column to datetime
                    df['Order_Date'] = pd.to_datetime(df['Order_Date'], errors='coerce')


                    # Convert 'Total_Amount' column to numeric
                    df['Total_Amount'] = pd.to_numeric(df['Total_Amount'], errors='coerce')


                    # Show the updated DataFrame
                    print("\nUpdated DataFrame:")
                    print(df)
                   Output:
                    Original DataFrame:

                           Order_ID       Order_Date      Total_Amount
                    0            101      2025/01/12                1000
                    1            102        2025/2/15           1500.75
                                   2               103  March 5, 2025                 2000
                                   3               104       2025.04.10  Not Available
                    Updated DataFrame:

                           Order_ID       Order_Date      Total_Amount
                    0            101      2025-01-12            1000.00
                    1            102      2025-02-15            1500.75
                    2            103               NaT          2000.00
                    3            104               NaT               NaN
                 Trimming Extra Spaces

                 Extra spaces in text can cause mismatches.
                 df['Name'] = df['Name'].str.strip()





                                                                      Theoretical and Practical Aspects of Data Processing  213
   210   211   212   213   214   215   216   217   218   219   220