Spaces:
Running
Running
Sathwikchowdary
commited on
Commit
•
37bac75
1
Parent(s):
e0d020a
Update pages/excel_Files.py
Browse files- pages/excel_Files.py +21 -5
pages/excel_Files.py
CHANGED
@@ -28,27 +28,27 @@ custom_css = """
|
|
28 |
|
29 |
# Inject the CSS into the app
|
30 |
st.markdown(custom_css, unsafe_allow_html=True)
|
31 |
-
st.markdown("<h2 style='text-align:
|
32 |
st.markdown(
|
33 |
"<p style='font-size: 16px; color: White; font-style: italic;'>"
|
34 |
"Excel is a popular tool for data management and analysis, commonly used to organize datasets that can then be used for machine learning tasks. For machine learning, Excel often serves as an initial data source before the data is preprocessed and fed into ML algorithms for tasks such as classification, regression, clustering, etc."
|
35 |
"</p>",
|
36 |
unsafe_allow_html=True
|
37 |
)
|
38 |
-
st.markdown("<h2 style='text-align:
|
39 |
st.markdown(
|
40 |
"<p style='font-size: 16px; color: White; font-style: italic;'>"
|
41 |
"Excel files can be read in various programming languages like Python (pandas), Matlab, R, and Microsoft Excel application."
|
42 |
"</p>",
|
43 |
unsafe_allow_html=True
|
44 |
)
|
45 |
-
st.markdown("<h2 style='text-align:
|
46 |
st.markdown("1. Corrupted Files")
|
47 |
st.markdown("2. Inconsistent Data Types")
|
48 |
st.markdown("3. Missing or Null Values")
|
49 |
st.markdown("4. Large File Size")
|
50 |
|
51 |
-
st.markdown("<h2 style='text-align:
|
52 |
st.markdown("<h3 style='text-align: left; color: Black;'>📈Corrupted Files</h3>", unsafe_allow_html=True)
|
53 |
st.markdown(
|
54 |
"<p style='font-size: 16px; color: White; font-style: italic;'>"
|
@@ -56,10 +56,26 @@ st.markdown(
|
|
56 |
"</p>",
|
57 |
unsafe_allow_html=True
|
58 |
)
|
59 |
-
st.markdown("<h3 style='text-align: left; color: Black;'>📈Inconsistent Data
|
60 |
st.markdown(
|
61 |
"<p style='font-size: 16px; color: White; font-style: italic;'>"
|
62 |
"Preprocess the data by explicitly converting columns to the correct data type using pandas (e.g., pd.to_numeric() for numeric data). Use Excel’s Data Validation feature to ensure consistent data entry."
|
63 |
"</p>",
|
64 |
unsafe_allow_html=True
|
65 |
)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
28 |
|
29 |
# Inject the CSS into the app
|
30 |
st.markdown(custom_css, unsafe_allow_html=True)
|
31 |
+
st.markdown("<h2 style='text-align: left; color: Black;'>📈What is Excel</h2>", unsafe_allow_html=True)
|
32 |
st.markdown(
|
33 |
"<p style='font-size: 16px; color: White; font-style: italic;'>"
|
34 |
"Excel is a popular tool for data management and analysis, commonly used to organize datasets that can then be used for machine learning tasks. For machine learning, Excel often serves as an initial data source before the data is preprocessed and fed into ML algorithms for tasks such as classification, regression, clustering, etc."
|
35 |
"</p>",
|
36 |
unsafe_allow_html=True
|
37 |
)
|
38 |
+
st.markdown("<h2 style='text-align: left; color: Black;'>📈How to read Excel files</h2>", unsafe_allow_html=True)
|
39 |
st.markdown(
|
40 |
"<p style='font-size: 16px; color: White; font-style: italic;'>"
|
41 |
"Excel files can be read in various programming languages like Python (pandas), Matlab, R, and Microsoft Excel application."
|
42 |
"</p>",
|
43 |
unsafe_allow_html=True
|
44 |
)
|
45 |
+
st.markdown("<h2 style='text-align: left; color: Black;'>📈Issues encountered when handling these files</h2>", unsafe_allow_html=True)
|
46 |
st.markdown("1. Corrupted Files")
|
47 |
st.markdown("2. Inconsistent Data Types")
|
48 |
st.markdown("3. Missing or Null Values")
|
49 |
st.markdown("4. Large File Size")
|
50 |
|
51 |
+
st.markdown("<h2 style='text-align: left; color: Black;'>📈How to overcome these errors/issues.</h2>", unsafe_allow_html=True)
|
52 |
st.markdown("<h3 style='text-align: left; color: Black;'>📈Corrupted Files</h3>", unsafe_allow_html=True)
|
53 |
st.markdown(
|
54 |
"<p style='font-size: 16px; color: White; font-style: italic;'>"
|
|
|
56 |
"</p>",
|
57 |
unsafe_allow_html=True
|
58 |
)
|
59 |
+
st.markdown("<h3 style='text-align: left; color: Black;'>📈Inconsistent Data </h3>", unsafe_allow_html=True)
|
60 |
st.markdown(
|
61 |
"<p style='font-size: 16px; color: White; font-style: italic;'>"
|
62 |
"Preprocess the data by explicitly converting columns to the correct data type using pandas (e.g., pd.to_numeric() for numeric data). Use Excel’s Data Validation feature to ensure consistent data entry."
|
63 |
"</p>",
|
64 |
unsafe_allow_html=True
|
65 |
)
|
66 |
+
st.markdown("<h3 style='text-align: left; color: Black;'>📈Null Values</h3>", unsafe_allow_html=True)
|
67 |
+
st.markdown(
|
68 |
+
"<p style='font-size: 16px; color: White; font-style: italic;'>"
|
69 |
+
"Many Excel files contain missing or blank cells, which can interfere with analysis or cause errors during processing."
|
70 |
+
"</p>",
|
71 |
+
unsafe_allow_html=True
|
72 |
+
)
|
73 |
+
st.markdown("<h3 style='text-align: left; color: Black;'>📈Large File Size</h3>", unsafe_allow_html=True)
|
74 |
+
st.markdown(
|
75 |
+
"<p style='font-size: 16px; color: White; font-style: italic;'>"
|
76 |
+
"Large Excel files with many rows and columns can cause performance issues, especially when reading them into memory with tools like Python."
|
77 |
+
"</p>",
|
78 |
+
unsafe_allow_html=True
|
79 |
+
)
|
80 |
+
|
81 |
+
|