Spaces:

Multichem-PD
/

DFS_Contest_Analyzer

Running

James McCool commited on 23 days ago

Commit

55a782f

1 Parent(s): c2b7029

Add unique and under-5 duplicate counts to working_df in app.py

- Implemented calculations for unique lineups and lineups with 5 or fewer duplicates for each BaseName, enhancing data analysis capabilities within the application.

Files changed (1) hide show

app.py +9 -0

app.py CHANGED Viewed

@@ -221,6 +221,15 @@ with tab2:
                 axis=1
             )
             working_df['dupes'] = working_df.groupby('sorted').transform('size')
             working_df = working_df.reset_index()
             working_df['percentile_finish'] = working_df['index'].rank(pct=True)
             working_df['finish'] = working_df['index']

                 axis=1
             )
             working_df['dupes'] = working_df.groupby('sorted').transform('size')
+            # For uniques - count how many unique lineups (dupes == 1) each BaseName has
+            working_df['uniques'] = working_df.groupby('BaseName').apply(
+                lambda x: (x['dupes'] == 1).sum()
+            ).reindex(working_df['BaseName']).values
+            # For under_5 - count how many lineups with 5 or fewer duplicates each BaseName has
+            working_df['under_5'] = working_df.groupby('BaseName').apply(
+                lambda x: (x['dupes'] <= 5).sum()
+            ).reindex(working_df['BaseName']).values
             working_df = working_df.reset_index()
             working_df['percentile_finish'] = working_df['index'].rank(pct=True)
             working_df['finish'] = working_df['index']