Spaces:

Multichem-PD
/

DFS_Contest_Analyzer

Running

James McCool commited on Apr 3

Commit

a87b532

1 Parent(s): 1ba31e0

Enhance lineup processing in `load_file.py` to improve data extraction

- Updated the logic for splitting the `Lineup` column into individual player entries, now including an additional column for accurate position mapping.
- Added functionality to remove position indicators from the beginning of each player entry, ensuring cleaner data and preventing misinterpretation of player names.

Files changed (1) hide show

global_func/load_file.py +4 -2

global_func/load_file.py CHANGED Viewed

@@ -26,9 +26,11 @@ def load_file(upload):
             # and not those that might appear within player names
             df['Lineup'] = df['Lineup'].str.replace(r'\b(' + '|'.join(pos_values) + r')\b', r'\1,', regex=True)
-            # Split into individual columns
-            for i in range(0,9):
                 df[i] = df['Lineup'].str.split(',').str[i].str.strip()
             position_dict = dict(zip(df['Player'], df['Pos']))
             ownership_dict = dict(zip(df['Player'], df['Own']))
             entry_list = list(set(df['EntryName']))

             # and not those that might appear within player names
             df['Lineup'] = df['Lineup'].str.replace(r'\b(' + '|'.join(pos_values) + r')\b', r'\1,', regex=True)
+            # Split into individual columns and remove position indicators
+            for i in range(0,10):
                 df[i] = df['Lineup'].str.split(',').str[i].str.strip()
+                # Remove position indicators from the beginning of each entry
+                df[i] = df[i].str.replace(r'^(' + '|'.join(pos_values) + r')\s+', '', regex=True)
             position_dict = dict(zip(df['Player'], df['Pos']))
             ownership_dict = dict(zip(df['Player'], df['Own']))
             entry_list = list(set(df['EntryName']))