Spaces:

1x-technologies
/

1X_World_Model_Challenge_Home

Running

File size: 10,146 Bytes

992d47a
29924b3
4b7e1bc
4cedfff
 
3aac702
4b7e1bc
 
 
03997d7
4b7e1bc
 
 
dcb2029
 
80e3dd0
4b7e1bc
 
71cede3
4b7e1bc
 
 
 
446cfff
4b7e1bc
 
 
 
 
 
 
 
 
446cfff
4b7e1bc
 
 
 
 
 
 
 
 
446cfff
4b7e1bc
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
80e3dd0
4b7e1bc
 
 
 
ffaf105
4b7e1bc
 
03997d7
4b7e1bc
eb12957
4b7e1bc
c2e844a
4cedfff
4b7e1bc
 
 
03997d7
c2e844a
4b7e1bc
 
 
 
c2e844a
eb12957
 
 
8d3f161
eb12957
4b7e1bc
 
03997d7
4b7e1bc
 
 
 
 
 
 
 
 
 
 
 
 
 
 
f105f4c
 
 
4b7e1bc
 
 
03997d7
4b7e1bc
03997d7
4b7e1bc
 
 
 
 
 
 
 
f5c9284
 
 
 
 
 
4b7e1bc
f5c9284
 
 
 
 
 
 
 
 
 
 
 
 
4b7e1bc
 
 
 
 
2dfca59
 
 
2e5a1c6
 
 
 
 
 
 
 
2dfca59
 
 
 
c477fe1
2dfca59
70db314
ffaf105
 
da0930c
b600f17
 
 
da0930c
 
d0e93f9
 
b600f17
da0930c
 
 
ffaf105
 
 
b600f17
da0930c
b600f17
 
b1bf6bc
 
 
 
 
 
 
e25a3d8
b1bf6bc
fc9346e
 
2dfca59
 
 
 
 
 
 
eb12957
4b7e1bc
d4aea06
ffaf105
16d53e1
 
 
 
 
 
 
 
 
 
 
 
ffaf105
4b7e1bc

import streamlit as st
import pandas as pd
import streamlit.components.v1 as components
import glob 
import os
import random

def scoring_section():
    # Title
    st.markdown("## Scoring")

    # Intro text
    st.write(
        "We weight performance across all three challenges, placing additional emphasis on the Evaluation Challenge. "
        "Each team's final rank is determined by the total points they accumulate from Compression, Sampling, and Evaluation."
    )
    
    # Points Breakdown in a table
    st.markdown("### Points Breakdown")
    # Create three columns for a more interesting layout
    col1, col2, col3 = st.columns(3)
    
    with col1:
        st.markdown('<h3 style="margin-left:15px;">Compression</h3>', unsafe_allow_html=True)
        st.markdown(
            """
            - **1st Place**: 10 points  
            - **2nd Place**: 7 points  
            - **3rd Place**: 5 points
            """
        )
        
    with col2:
        st.markdown('<h3 style="margin-left:15px;">Sampling</h3>', unsafe_allow_html=True)
        st.markdown(
            """
            - **1st Place**: 10 points  
            - **2nd Place**: 7 points  
            - **3rd Place**: 5 points
            """
        )
        
    with col3:
        st.markdown('<h3 style="margin-left:15px;">Evaluation</h3>', unsafe_allow_html=True)
        st.markdown(
            """
            - **1st Place**: 20 points  
            - **2nd Place**: 14 points  
            - **3rd Place**: 10 points
            """
        )
    # Tie-Breakers in an expander for a cleaner layout
    with st.expander("Tie-Breakers"):
        st.write(
            "The overall winner will be the team with the highest total points. "
            "In the event of a tie, the following tie-breakers will be applied in order:\n\n"
            "1. Highest Evaluation Challenge score\n"
            "2. Highest Sampling Challenge score\n"
            "3. Highest Compression Challenge score\n\n"
        )

    # Overall Leaderboard Section
    st.write(
        "The leaderboard, which shows the total points across all challenges, will go live on **March 10th**. "
        "Additionally, each challenge—**Compression**, **Sampling**, and **Evaluation**—will have its own leaderboard on their "
        "respective Hugging Face submission servers."
    )



def main():
    st.set_page_config(page_title="1X World Model Challenge")
    
    st.title("World Model Challenge")
    st.markdown("## Welcome")
    st.write(
        "Welcome to the World Model Challenge. This platform hosts three challenges "
        "designed to advance research in world models for robotics: Compression, Sampling, and Evaluation."
    )


    st.markdown("---")

    st.markdown("## Motivation")
    st.write(
        "Real-world robotics faces a fundamental challenge: environments are dynamic and change over time, "
        "making consistent evaluation of robot performance difficult. World models offer a solution by "
        "learning to simulate complex real-world interactions from raw sensor data. We believe these learned simulators will enable "
        "robust evaluation and iterative improvement of robot policies without the constraints of a physical testbed."
    )
    st.image(
    "assets/model_performance_over_time.webp",
    caption="An example T-shirt folding model we trained that degrades in performance over the course of 50 days.",
    use_container_width=True
)
    st.markdown("---")

    st.markdown("## The Challenges")

    st.markdown("#### Compression Challenge")
    st.write(
        "In the Compression Challenge, your task is to train a model to compress our robots logs effectively while preserving the critical details needed to understand and predict future interactions. Success in this challenge is measured by the loss of your model—the lower the loss, the better your model captures the complexities of real-world robot behavior."
    )
    
    st.markdown("#### Sampling Challenge")
    st.write(
        "In the Sampling Challenge, your task is to predict a future video frame two seconds in the future given a short clip of robot interactions. The goal is to produce a coherent and plausible continuation of the video, which accurately reflects the dynamics of the scene. Your submission will be judged on how closely it matches the actual frame."
    )
    
    st.markdown("#### Evaluation Challenge")
    st.write(
        "The Evaluation Challenge tackles the ultimate question: Can you predict a robot's performance in the real world without physically deploying it? In this challenge, you will be provided with many different policies for a specific task. Your task is to rank these policies according to their expected real-world performance. This ranking will be compared with the actual ranking of the policies."
    )

    st.markdown("**Note:** Links to the evaluation servers will be released on March 1st.")

    st.markdown("---")


    st.markdown("## Datasets")
    st.write(
        "We provide two datasets to support the 1X World Model Challenge:\n\n"
        "**Raw Data:** The [world_model_raw_data](https://huggingface.co/datasets/1x-technologies/world_model_raw_data) dataset "
        "provides raw sensor data, video logs, and annotated robot state sequences gathered from diverse real-world scenarios. "
        "This dataset is split into 100 shards—each containing a 512x512 MP4 video, a segment index mapping, and state arrays—"
        "and is licensed under CC-BY-NC-SA 4.0.\n\n"
        "**Tokenized Data:** The [world_model_tokenized_data](https://huggingface.co/datasets/1x-technologies/world_model_tokenized_data) dataset "
        "tokenizes the raw video sequences generated using the NVIDIA Cosmos Tokenizer. This compact representation of the raw data "
        "is optimal for the compression challenge and is released under the Apache 2.0 license.\n\n"
    )

    gif_folder = "assets/v1.0"
    
    # Get all GIF file paths from the folder, sorted by name
    gif_paths = glob.glob(os.path.join(gif_folder, "*.gif"))
    random.shuffle(gif_paths)
    
    # Display 4 GIFs per row
    for i in range(0, 16, 4):
        # Slice out a batch of 4 GIFs
        row_gifs = gif_paths[i:i+4]
        
        # Create columns for this row
        cols = st.columns(len(row_gifs))
        
        # Display each GIF in its own column
        for col, gif_path in zip(cols, row_gifs):
            col.image(gif_path, use_container_width=True)


    st.markdown("---")


    scoring_section()

    def display_faq(question, answer):
        st.markdown(
            f"""
            <div style="
                padding: 12px; 
                margin-bottom: 12px; 
                background-color: #0d1b2a; 
                border-radius: 8px; 
                border: 1px solid #0d1b2a;">
                <p style="font-weight: bold; margin: 0 0 4px 0; color: #ffffff;">{question}</p>
                <p style="margin: 0; color: #ffffff;">{answer}</p>
            </div>
            """,
            unsafe_allow_html=True
        )
    st.markdown("---")

    st.markdown("## Rules")
    st.markdown(
        """
    **General Guidelines:**
    - The use of publicly available datasets and pretrained weights is allowed. The use of private datasets or pretrained weights is prohibited.
    - You may use future actions to condition future frame predictions.
    - There is no limit on the inference time for any of the challenges.
    - Naive nearest-neighbor retrieval combined with seeking ahead to the next frames from the training set may yield reasonable performance but is not permitted in solutions.

    **Submissions:**
    - All submissions must be reproducible. Include code, configuration files, and any necessary instructions to replicate your results.
    - The leaderboard will display results on a public test set; however, the final winner will be determined based on performance on a private test set.

    **Eligibility:**
    - Prizes cannot be awarded to individuals in U.S. sanctioned countries. We reserve the right to withhold prizes if a submission violates the spirit of the challenge.
        """,
        unsafe_allow_html=True
    )

    st.markdown("**Note:** Each challenge has additional rules, which will be released when the challenges officially launch on March 1st.")
    st.markdown("---")


    st.markdown("## Already Started Working on These Challenges?")
    st.write(
        "If you've already begun work on any of the challenges, we encourage you to share your progress with the community. "
        "Connect with other participants via our Discord channel or GitHub repository to exchange ideas, get feedback, and collaborate."
    )

    st.markdown("---")

    st.markdown("## FAQs")

    display_faq("Do I have to participate in all challenges?", 
                "No, you may choose to participate in one or more challenges. However, participating in multiple challenges may improve your overall ranking.")

    display_faq("Can I work in a team?", 
                "Yes, team submissions are welcome.")

    display_faq("What are the submission deadlines?", 
                "Deadlines for challenges soon to be announced.")
    
    st.markdown("---")

    st.markdown("## Additional Data & Research Requests")
    st.markdown(
        """
    Beyond the World Model Challenge, we also want to make the challenges and datasets more useful for your research questions. 
    Want more data interacting with humans? More safety-critical tasks like carrying cups of hot coffee without spilling? 
    More dexterous tool use? Robots working with other robots? Robots dressing themselves in the mirror? 
    Think of 1X as the operations team for getting you high-quality humanoid data in extremely diverse scenarios.

    Email [[email protected]](mailto:[email protected]) with your requests (and why you think the data is important) and we will try to include it in a future data release. 
    You can also discuss your data questions with the community on Discord.
        """
    )

if __name__ == '__main__':
    main()