Jack Monas
task revise
e0b70ca
raw
history blame
3.6 kB
import streamlit as st
def main():
st.set_page_config(page_title="World Model Challenge")
st.title("World Model Challenge")
st.markdown("### Welcome")
st.write(
"Welcome to the World Model Challenge server. This platform hosts three interrelated challenges "
"designed to advance research in world models for robotics: Compression, Sampling, and Evaluation."
)
st.markdown("### Motivation")
st.write(
"Real-world robotics faces a fundamental challenge: environments are dynamic and change over time, "
"making consistent evaluation of robot performance difficult. World models offer a solution by "
"learning to simulate complex real-world interactions from raw sensor data. We believe these learned simulators will enable"
"robust evaluation and iterative improvement of robot policies without the constraints of a physical testbed."
)
st.markdown("### The Challenges")
st.markdown("#### Compression Challenge")
st.write(
"The Compression Challenge focuses on minimizing training loss over a diverse robot dataset. "
"By effectively compressing and understanding the data, the challenge aims to measure how well a model "
"can capture the complexities of real-world robot interactions. A lower loss indicates a better grasp "
"of the underlying data."
)
st.markdown("#### Sampling Challenge")
st.write(
"Sampling focuses on generating realistic future outcomes in video sequences by predicting the next"
"frame given a sequence of prior frames. The goal is to produce coherent and plausible continuations"
"of the video, accurately reflecting the dynamics of the scene. "
)
st.markdown("#### Evaluation Challenge")
st.write(
"The Evaluation Challenge tackles the ultimate question: Can you predict a robot's performance in the real world "
"without physically deploying it? In this challenge, participants are tasked with ranking a set of robot policies based solely on simulation data."
"These rankings will be compared to the real-world performance of the policies to determine a winner."
)
st.markdown("#### Compression Challenge")
st.write(
"In the Compression Challenge, your task is to train a model to compress our robots logs effectively while preserving the critical details needed to understand and predict future interactions. Success in this challenge is measured by the loss of your model—the lower the loss, the better your model captures the complexities of real-world robot behavior."
)
st.markdown("#### Sampling Challenge")
st.write(
"In the Sampling Challenge, your task is to predict a future video frame two seconds in the future given a short clip of robot interactions. The goal is to produce a coherent and plausible continuations of the video, which accurately reflects the dynamics of the scene. Your submission will be judged on how closely it matches the actual frame."
)
st.markdown("#### Evaluation Challenge")
st.write(
"The Evaluation Challenge tackles the ultimate question: Can you predict a robot's performance in the real world without physically deploying it? In this challenge, you will be provided with many different policies for a specific task. Your task is to rank these policies according to their expected real-world performance. This ranking will be compared with the actual ranking of the policies."
)
if __name__ == '__main__':
main()