Spaces:

scfive
/

bodybuilding-pose-app

Sleeping

App Files Files Community

Saket Shirsath commited on Nov 1, 2020

Commit

af387c1

1 Parent(s): 6f5a5a2

Update index.md

Browse files

Files changed (1) hide show

index.md +58 -1

index.md CHANGED Viewed

@@ -2,6 +2,7 @@
 <br>
 [Project Proposal](proposal.md)
 ## Abstract:
@@ -18,7 +19,7 @@ One of our milestones for this iteration of our project was the successful pose
 | Squat | Bench | Deadlift |
 | ----- | ----- | -------- |
-|<img src="assets\squat_pose.png" height="500px">|<img src="assets\bench_pose.png" height="500px">|<img src="assets\deadlift_pose.png" height="500px">|
 ## Results for HPE
@@ -31,5 +32,61 @@ Nevertheless, the results of the human pose estimation were very promising. We s
 To solve our issue with accurate exercise classification, we experimented with a brute-force categorical convolutional neural network built with Keras. Using the Bing Web Search API, we scraped 250 images for each of the chosen exercises: barbell bench press, barbell back squat, and barbell deadlift. Then, we manually filtered through those to throw away faulty representations and fix formatting issues. For our final dataset, we were left with 155 bench press images, 196 squat images, and 198 deadlift images. For model validation, we randomly allocated 20% of our dataset. The associated image collections are shown in the Appendix.

 <br>
 [Project Proposal](proposal.md)
+# Midterm Update 1
 ## Abstract:
 | Squat | Bench | Deadlift |
 | ----- | ----- | -------- |
+|<img src="assets\squat_pose.png" height="100px">|<img src="assets\bench_pose.png" height="100px">|<img src="assets\deadlift_pose.png" height="100px">|
 ## Results for HPE
 To solve our issue with accurate exercise classification, we experimented with a brute-force categorical convolutional neural network built with Keras. Using the Bing Web Search API, we scraped 250 images for each of the chosen exercises: barbell bench press, barbell back squat, and barbell deadlift. Then, we manually filtered through those to throw away faulty representations and fix formatting issues. For our final dataset, we were left with 155 bench press images, 196 squat images, and 198 deadlift images. For model validation, we randomly allocated 20% of our dataset. The associated image collections are shown in the Appendix.
+## Results
+We started with a single-layer CNN, altering input normalization resolution to see how it affected training and validation accuracy and loss. These preliminary results indicate decent validation accuracy (~70%) for 25x25 and 50x50 normalizations. The 100x100 and 150x150 normalizations are significantly more accurate at around 80%. These graphs indicate that our model is overfitting our training dataset, as is evident by the nearly 100% training accuracy. This is to be expected due to our extremely small dataset.
+| 25x25 | <img src="assets/cnn1/cnn1_25_acc.png" height="100px"> | <img src="assets/cnn1/cnn1_25_loss.png" height="100px"> |
+| 50x50 | <img src="assets/cnn1/cnn1_50_acc.png" height="100px"> | <img src="assets/cnn1/cnn1_50_loss.png" height="100px"> |
+| 100x100 | <img src="assets/cnn1/cnn1_100_acc.png" height="100px"> | <img src="assets/cnn1/cnn1_100_loss.png" height="100px"> |
+| 150x150 | <img src="assets/cnn1/cnn1_150_acc.png" height="100px"> | <img src="assets/cnn1/cnn1_150_loss.png" height="100px"> |
+The results for the three-layer CNN are worse than the single-layer design. Typically, more layers result in higher validation accuracy, but our dataset is so small and our model is so overfitted that we get the opposite effect, resulting in significantly less validation accuracy across the board.
+| 25x25 | <img src="assets/cnn2/cnn2_25_acc.png" height="100px"> | <img src="assets/cnn2/cnn2_25_loss.png" height="100px"> |
+| 50x50 | <img src="assets/cnn2/cnn2_50_acc.png" height="100px"> | <img src="assets/cnn2/cnn2_50_loss.png" height="100px"> |
+| 100x100 | <img src="assets/cnn2/cnn2_100_acc.png" height="100px"> | <img src="assets/cnn2/cnn2_100_loss.png" height="100px"> |
+| 150x150 | <img src="assets/cnn2/cnn2_150_acc.png" height="100px"> | <img src="assets/cnn2/cnn2_150_loss.png" height="100px"> |
+We also see a similar trend with our four-layer implementation.
+| 25x25 | <img src="assets/cnn3/cnn3_25_acc.png" height="100px"> | <img src="assets/cnn3/cnn3_25_loss.png" height="100px"> |
+| 50x50 | <img src="assets/cnn3/cnn3_50_acc.png" height="100px"> | <img src="assets/cnn3/cnn3_50_loss.png" height="100px"> |
+| 100x100 | <img src="assets/cnn3/cnn3_100_acc.png" height="100px"> | <img src="assets/cnn3/cnn3_100_loss.png" height="100px"> |
+| 150x150 | <img src="assets/cnn3/cnn3_150_acc.png" height="100px"> | <img src="assets/cnn3/cnn3_150_loss.png" height="100px"> |
+Next Steps
+Our current dataset is severely limiting the performance of our CNN. There are two ways to expand it: (1) data augmentation and (2) just collecting more images. We will experiment with both of these for the final implementation to get our validation accuracy closer to 100%. It may be possible to improve accuracy by running our dataset images with their associated HPE outputs.
+Once we fix these points, the multi-layer designs should drastically outperform their single-layer counterpart. In fact, it might even be possible to reduce our normalized image sizes to improve training speeds. As shown earlier, a 50x50 image may be sufficient, as is evident by its close to 100% training accuracy.
+Now that we are able to successfully classify our exercise image inputs as the correct exercise, we can move forward with our original goal of outputting recommendations to users regarding their form. One approach we can take is using the output of our human pose estimation CNN to measure the location of certain joints compared to others. For example, in a squat, if the elbows are parallel to the wrist, that is an indication of bad form, and we should be able to detect that.
+Another approach is to expand on the CNN we used to classify exercises. We can potentially use it to classify both the exercise and any indications of bad form at the same time. The only potential issue with this is that we will have to generate an even larger dataset of images than before to accommodate exercises of good form and bad form.
+## Appendix
+### Squat HPE
+<img src="assets\squat_wireframe.png" height="500px">
+### Bench HPE
+<img src="assets\bench_wireframe.png" height="500px">
+### Deadlift HPE
+<img src="assets\deadlift_wireframe.png" height="500px">
+<img src="assets\bench_training.png" height="500px">
+<img src="assets\bench_testing.png" height="500px">
+<img src="assets\squat_training.png" height="500px">
+<img src="assets\squat_testing.png" height="500px">
+<img src="assets\deadlift_training.png" height="500px">
+<img src="assets\deadlift_testing.png" height="500px">