{ "cells": [ { "cell_type": "markdown", "metadata": { "id": "DS4E1PagbDgL" }, "source": [ "# Setup" ] }, { "cell_type": "markdown", "metadata": { "id": "tQ1r1bbb0yBv" }, "source": [ "\n", "## Instructions\n", "1. Work on a copy of this notebook: _File_ > _Save a copy in Drive_ (you will need a Google account).\n", "2. (Optional) If you would like to do the deep learning component of this tutorial, turn on the GPU with Edit->Notebook settings->Hardware accelerator->GPU\n", "3. Execute the following cell (click on it and press Ctrl+Enter) to install Julia, IJulia and other packages (if needed, update `JULIA_VERSION` and the other parameters). This takes a couple of minutes.\n", "4. Continue to the next section.\n", "\n", "_Notes_:\n", "* If your Colab Runtime gets reset (e.g., due to inactivity), repeat steps 3, 4.\n", "* After installation, if you want to change the Julia version or activate/deactivate the GPU, you will need to reset the Runtime: _Runtime_ > _Delete and disconnect runtime_ and repeat steps 2-4." ] }, { "cell_type": "markdown", "metadata": { "id": "COndi88gbDgO" }, "source": [ "**Run the following code if you need to install Julia**\n", "\n", "**(select all lines -> Option-/)**" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "GIeFXS0F0zww", "outputId": "f25272ac-a660-42fd-d739-82778e6d7415" }, "outputs": [], "source": [ "%%shell\n", "set -e\n", "\n", "#---------------------------------------------------#\n", "JULIA_VERSION=\"1.7.2\"\n", "JULIA_NUM_THREADS=4\n", "#---------------------------------------------------#\n", "\n", "if [ -z `which julia` ]; then\n", " # Install Julia\n", " JULIA_VER=`cut -d '.' -f -2 <<< \"$JULIA_VERSION\"`\n", " echo \"Installing Julia $JULIA_VERSION on the current Colab Runtime...\"\n", " BASE_URL=\"https://julialang-s3.julialang.org/bin/linux/x64\"\n", " URL=\"$BASE_URL/$JULIA_VER/julia-$JULIA_VERSION-linux-x86_64.tar.gz\"\n", " wget -nv $URL -O /tmp/julia.tar.gz # -nv means \"not verbose\"\n", " tar -x -f /tmp/julia.tar.gz -C /usr/local --strip-components 1\n", " rm /tmp/julia.tar.gz\n", " julia -e 'println(\"Success\")'\n", "fi" ] }, { "cell_type": "markdown", "metadata": { "id": "ORv1c6xvbDgV" }, "source": [ "Install PySR and PyTorch-Lightning:" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "EhMRSZEYFPLz" }, "outputs": [], "source": [ "!pip install -Uq pysr pytorch_lightning" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "j666aOI8xWF_" }, "outputs": [], "source": [ "# Required to get printing from Julia working in colab \n", "# (you don't need to normally do this)\n", "from julia import Julia\n", "julia = Julia(compiled_modules=False)\n", "from julia import Main\n", "from julia.tools import redirect_output_streams\n", "redirect_output_streams()" ] }, { "cell_type": "markdown", "metadata": { "id": "6u2WhbVhht-G" }, "source": [ "Let's install the backend of PySR, and all required libraries. We will also precompile them so they are faster at startup.\n", "\n", "**This will take ~5 minutes.**" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "J-0QbxyK1_51", "outputId": "3742d548-3574-4739-80b1-ee9e906e4b57" }, "outputs": [], "source": [ "import pysr\n", "pysr.install()" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "vFpyRxmhFqeH" }, "outputs": [], "source": [ "import sympy\n", "import numpy as np\n", "from matplotlib import pyplot as plt\n", "from pysr import PySRRegressor\n", "import torch\n", "from torch import nn, optim\n", "from torch.nn import functional as F\n", "from torch.utils.data import DataLoader, TensorDataset\n", "import pytorch_lightning as pl\n", "from sklearn.model_selection import train_test_split" ] }, { "cell_type": "markdown", "metadata": { "id": "gsRMQ7grbDga" }, "source": [ "# Simple PySR example:\n" ] }, { "cell_type": "markdown", "metadata": { "id": "myTEwdiUFiGL" }, "source": [ "First, let's learn a simple function\n", "\n", "$$2.5382 \\cos(x3) + x0^2 - 2$$" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "Cb1eb2XuFQh8" }, "outputs": [], "source": [ "# Dataset\n", "np.random.seed(0)\n", "X = 2*np.random.randn(100, 5)\n", "y = 2.5382*np.cos(X[:, 3]) + X[:, 0]**2 - 2" ] }, { "cell_type": "markdown", "metadata": { "id": "cturCkaVjzLs" }, "source": [ "By default, we will set up 30 populations of expressions (which evolve independently except for migrations), use 4 threads, and use `\"best\"` for our model selection strategy:" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "4nDAAnisdhTc" }, "outputs": [], "source": [ "default_pysr_params = dict(\n", " populations=30,\n", " procs=4,\n", " model_selection=\"best\",\n", ")" ] }, { "cell_type": "markdown", "metadata": { "id": "N4gANfkaj8ie" }, "source": [ "PySR can run for arbitrarily long, and continue to find more and more accurate expressions. You can set the total number of cycles of evolution with `niterations`, although there are also a [few more ways](https://github.com/MilesCranmer/PySR/pull/134) to stop execution.\n", "\n", "**This first execution will take a bit longer to startup, as the library is JIT-compiled. The next execution will be much faster.**" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "p4PSrO-NK1Wa", "outputId": "474b4780-5d94-4795-88b9-225030b17abe", "scrolled": true }, "outputs": [], "source": [ "# Learn equations\n", "model = PySRRegressor(\n", " niterations=30,\n", " binary_operators=[\"plus\", \"mult\"],\n", " unary_operators=[\"cos\", \"exp\", \"sin\"],\n", " **default_pysr_params)\n", "\n", "model.fit(X, y)" ] }, { "cell_type": "markdown", "metadata": { "id": "-bsAECbdkQsQ" }, "source": [ "We can print the model, which will print out all the discovered expressions:" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "4HR8gknlZz4W", "outputId": "606c26ad-6a7d-42e0-a014-fc84bc898ef7" }, "outputs": [], "source": [ "model" ] }, { "cell_type": "markdown", "metadata": { "id": "ME3ddPxXkWQg" }, "source": [ "We can also view the SymPy format of the best expression:" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 38 }, "id": "IQKOohdpztS7", "outputId": "b0538aca-3916-4d2b-f5f2-f394cdba3dad" }, "outputs": [], "source": [ "model.sympy()" ] }, { "cell_type": "markdown", "metadata": { "id": "EHIIPlmClltn" }, "source": [ "We can also view the SymPy of any other expression in the list, using the index of it in `model.equation`." ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 38 }, "id": "GRcxq-TTlpRX", "outputId": "5e13599e-d469-4110-94a4-689023b40717" }, "outputs": [], "source": [ "model.sympy(2)" ] }, { "cell_type": "markdown", "metadata": { "id": "YMugcGX4tbqj" }, "source": [ "## Output" ] }, { "cell_type": "markdown", "metadata": { "id": "gIWt5wz5cjXE" }, "source": [ "`model.equations` is a Pandas DataFrame. We can exports the results in various ways:" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 35 }, "id": "HFGaNL6tbDgi", "outputId": "260b0db4-862f-4101-c494-8b03756ed126" }, "outputs": [], "source": [ "model.latex()" ] }, { "cell_type": "markdown", "metadata": { "id": "4hS8kqutcmPQ" }, "source": [ "These is also `model.sympy(), model.jax(), model.pytorch()`. All of these can take an index as input, to get the result for an arbitrary equation in the list.\n", "\n", "We can also use `model.predict` for arbitrary equations, with the default equation being the one chosen by `model_selection`:" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "Vbz4IMsk2NYH", "outputId": "c0eb8aeb-6656-40a2-eeaf-cd733a2593b8" }, "outputs": [], "source": [ "ypredict = model.predict(X)\n", "ypredict_simpler = model.predict(X, 2)\n", "\n", "print(\"Default selection MSE:\", np.power(ypredict - y, 2).mean())\n", "print(\"Manual selection MSE for index 2:\", np.power(ypredict_simpler - y, 2).mean())" ] }, { "cell_type": "markdown", "metadata": { "id": "SQDUScGebDgr" }, "source": [ "# Custom operators" ] }, { "cell_type": "markdown", "metadata": { "id": "qvgVbOoSFtQY" }, "source": [ "A full list of operators is given here: https://pysr.readthedocs.io/en/latest/docs/operators/,\n", "but we can also use any binary or unary operator in `julia`, or define our own as arbitrary functions.\n", "\n", "Say that we want a command to do quartic powers:\n", "\n", "$$ y = x_0^4 - 2 $$" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "JvXOVqSyFsdr" }, "outputs": [], "source": [ "y = X[:, 0]**4 - 2" ] }, { "cell_type": "markdown", "metadata": { "id": "-zoqaL8KGSK5" }, "source": [ "We can do this by passing a string in Julia syntax.\n", "\n", "We also define the operator in sympy, with `extra_sympy_mappings`, to enable its use in `predict`, and other export functions." ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "PoEkpvYuGUdy", "outputId": "18493db1-67e3-4493-f5e7-19277dd003d9", "scrolled": true }, "outputs": [], "source": [ "model = PySRRegressor(\n", " niterations=5, populations=40,\n", " binary_operators=[\"plus\", \"mult\"],\n", " unary_operators=[\"cos\", \"exp\", \"sin\",\n", " \"quart(x) = x^4\"],\n", " extra_sympy_mappings={\"quart\": lambda x: x**4})\n", "model.fit(X, y) " ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 38 }, "id": "emn2IajKbDgy", "outputId": "7bfade39-b95e-4314-8d46-854e2421c496" }, "outputs": [], "source": [ "model.sympy()" ] }, { "cell_type": "markdown", "metadata": { "id": "wbWHyOjl2_kX" }, "source": [ "Since `quart` is arguably more complex than the other operators, you can also give it a different complexity, using, e.g., `complexity_of_operators={\"quart\": 2}` to give it a complexity of 2 (instead of the default 2). You can also define custom complexities for variables and constants (`complexity_of_variables` and `complexity_of_constants`, respectively - both take a single number).\n", "\n", "\n", "One can also add a binary operator, with, e.g., `\"myoperator(x, y) = x^2 * y\"`. All Julia operators that work on scalar 32-bit floating point values are available.\n", "\n", "Make sure that any operator you add is valid over the real line. So, e.g., you will need to define `\"mysqrt(x) = sqrt(abs(x))\"` to enable it for negative numbers,\n", "or, simply have it return a very large number for bad inputs (to prevent negative input in a soft way):\n", "`\"mysqrt(x::T) where {T} = (x >= 0) ? x : T(-1e9)\"` (Julia syntax for a template function of input type `T`), which will make `mysqrt(x)` return -10^9 for negative x–hurting the loss of the equation." ] }, { "cell_type": "markdown", "metadata": { "id": "pEXT4xskbDg0" }, "source": [ "## Scoring" ] }, { "cell_type": "markdown", "metadata": { "id": "IyeYbVVOG60w" }, "source": [ "Using `model_selection=\"best\"`selects the equation with the max score and prints it. But in practice it is best to look through all the equations manually, select an equation above some MSE threshold, and then use the score to select among that loss threshold.\n", "\n", "Here, \"score\" is defined by:\n", "$$ \\text{score} = - \\log(\\text{loss}_i/\\text{loss}_{i-1})/\n", "(\\text{complexity}_i - \\text{complexity}_{i-1})$$" ] }, { "cell_type": "markdown", "metadata": { "id": "I3IxmvSQrhfw" }, "source": [ "This scoring is motivated by the common strategy of looking for drops in the loss-complexity curve.\n", "\n", "From Schmidt & Lipson (2009) -" ] }, { "cell_type": "markdown", "metadata": { "id": "eUeXyoLxrd8o" }, "source": [ "![F4.large.jpg]()" ] }, { "cell_type": "markdown", "metadata": { "id": "gDZyxsA7bDg9" }, "source": [ "# Noise example" ] }, { "cell_type": "markdown", "metadata": { "id": "cJCHdDt6IOou" }, "source": [ "Here is an example with noise. Known Gaussian noise with $\\sigma$ between 0.1 and 5.0. We record samples of $y$:\n", "\n", "$$ \\sigma \\sim U(0.1, 5.0) $$\n", "$$ \\epsilon \\sim \\mathcal{N}(0, \\sigma^2)$$\n", "$$ y = 5\\;\\cos(3.5 x_0) - 1.3 + \\epsilon.$$\n", "We have 5 features, say. The weights change the loss function to be:\n", "$$MSE = \\sum [(y - f(x))^2*w],$$\n", "\n", "so in this example, we can set:\n", "$$w = 1/\\sigma^2.$$" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "up1RvmwyOdal" }, "outputs": [], "source": [ "np.random.seed(0)\n", "N = 3000\n", "upper_sigma = 5\n", "X = 2*np.random.rand(N, 5)\n", "sigma = np.random.rand(N)*(5-0.1) + 0.1\n", "eps = sigma*np.random.randn(N)\n", "y = 5*np.cos(3.5*X[:, 0]) - 1.3 + eps" ] }, { "cell_type": "markdown", "metadata": { "id": "-EJPDZbP5YEZ" }, "source": [ "Let's look at this dataset:" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 298 }, "id": "sqMqb4nJ5ZR5", "outputId": "7f06a215-f3b6-4053-fc98-5da227a388a3" }, "outputs": [], "source": [ "plt.scatter(X[:, 0], y, alpha=0.2)\n", "plt.xlabel(\"$x_0$\")\n", "plt.ylabel(\"$y$\")" ] }, { "cell_type": "markdown", "metadata": { "id": "kaddasbBuDDv" }, "source": [ "Define some weights to use:" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "3wqz9_sIbDhA" }, "outputs": [], "source": [ "weights = 1/sigma**2" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "v8WBYtcZbDhC", "outputId": "de926074-1742-4fa3-fe44-188307e9214c" }, "outputs": [], "source": [ "weights[:5]" ] }, { "cell_type": "markdown", "metadata": { "id": "NXWdQSCFuAzV" }, "source": [ "Let's run PySR again:" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "a07K3KUjOxcp", "outputId": "629898f5-36aa-4616-bdb7-b41e07619a02", "scrolled": true }, "outputs": [], "source": [ "model = PySRRegressor(\n", " loss='myloss(x, y, w) = w * abs(x - y)', # Custom loss function with weights.\n", " niterations=20,\n", " populations=20, # Use more populations\n", " binary_operators=[\"plus\", \"mult\"],\n", " unary_operators=[\"cos\"]\n", ")\n", "model.fit(X, y, weights=weights)" ] }, { "cell_type": "markdown", "metadata": { "id": "CHCMO9CouFLP" }, "source": [ "Let's see if we get similar results to the true equation" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "oHyUbcg6ggmx", "outputId": "a261b520-5a89-42c9-982d-b05b905885fb" }, "outputs": [], "source": [ "model" ] }, { "cell_type": "markdown", "metadata": { "id": "OchlZZQP8Ums" }, "source": [ "We can also filter all equations up to 2x the most accurate equation, then select the best score from that list:" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 38 }, "id": "PB67POLr8b_L", "outputId": "69e4d94b-a00a-4e59-a781-38571b5f42e0" }, "outputs": [], "source": [ "best_idx = model.equations.query(f\"loss < {2 * model.equations.loss.min()}\").score.idxmax()\n", "model.sympy(best_idx)" ] }, { "cell_type": "markdown", "metadata": { "id": "SRHTP4x55roh" }, "source": [ "We can also use `denoise=True`, which will run the input through a Gaussian process to denoise the dataset, before fitting on it." ] }, { "cell_type": "markdown", "metadata": { "id": "eTGQ4NA78yAw" }, "source": [ "Let's look at the fit:" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 265 }, "id": "ezCC0IkS8zFf", "outputId": "d928f975-3843-4430-93fa-4ae3a18abb51" }, "outputs": [], "source": [ "plt.scatter(X[:, 0], y, alpha=0.1)\n", "plt.scatter(X[:, 0], model.predict(X, index=best_idx));" ] }, { "cell_type": "markdown", "metadata": { "id": "cPc1EDvRbDhL" }, "source": [ "# High-dimensional input: Neural Nets + Symbolic Regression" ] }, { "cell_type": "markdown", "metadata": { "id": "3hS2kTAbbDhL" }, "source": [ "In this example, let's learn a high-dimensional problem. **This will use the method proposed in our NeurIPS paper: https://arxiv.org/abs/2006.11287.**\n", "\n", "Let's consider a time series problem:\n", "\n", "$$ z = y^2,\\quad y = \\frac{1}{100} \\sum(y_i),\\quad y_i = x_{i0}^2 + 6 \\cos(2*x_{i2})$$\n", "\n", "Imagine our time series is 100 timesteps. That is very hard for symbolic regression, even if we impose the inductive bias of $$z=f(\\sum g(x_i))$$ - it is the square of the number of possible equations!\n", "\n", "But, as in our paper, **we can break this problem down into parts with a neural network. Then approximate the neural network with the symbolic regression!**\n", "\n", "Then, instead of, say, $(10^9)^2=10^{18}$ equations, we only have to consider $2\\times 10^9$ equations." ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "SXJGXySlbDhL", "outputId": "ea1af43c-3823-4779-f981-f8ccc431d055" }, "outputs": [], "source": [ "###### np.random.seed(0)\n", "N = 100000\n", "Nt = 100\n", "X = 6 * np.random.rand(N, Nt, 5) - 3\n", "y_i = X[..., 0]**2 + 6*np.cos(2*X[..., 2])\n", "y = np.sum(y_i, axis=1)/y_i.shape[1]\n", "z = y**2\n", "X.shape, y.shape" ] }, { "cell_type": "markdown", "metadata": { "id": "8ZqGupq_uSgp" }, "source": [ "## Neural Network definition" ] }, { "cell_type": "markdown", "metadata": { "id": "r2NR0h8-bDhN" }, "source": [ "So, as described above, let's first use a neural network with the sum inductive bias to solve this problem.\n", "\n", "Essentially, we will learn two neural networks:\n", "- `f`\n", "- `g`\n", "\n", "each defined as a multi-layer perceptron. We will sum over `g` the same way as in our equation, but we won't define the summed part beforehand.\n", "\n", "Then, we will fit `g` and `f` **separately** using symbolic regression." ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "nWVfkV_YbDhO" }, "outputs": [], "source": [ "hidden = 128\n", "total_steps = 50000\n", "\n", "def mlp(size_in, size_out, act=nn.ReLU):\n", " return nn.Sequential(\n", " nn.Linear(size_in, hidden),\n", " act(),\n", " nn.Linear(hidden, hidden),\n", " act(),\n", " nn.Linear(hidden, hidden),\n", " act(),\n", " nn.Linear(hidden, size_out))\n", "\n", "\n", "class SumNet(pl.LightningModule):\n", " def __init__(self):\n", " super().__init__()\n", " \n", " ########################################################\n", " # The same inductive bias as above!\n", " self.g = mlp(5, 1)\n", " self.f = mlp(1, 1)\n", "\n", " def forward(self, x):\n", " y_i = self.g(x)[:, :, 0]\n", " y = torch.sum(y_i, dim=1, keepdim=True)/y_i.shape[1]\n", " z = self.f(y)\n", " return z[:, 0]\n", " ########################################################\n", " \n", " # PyTorch Lightning bookkeeping:\n", " def training_step(self, batch, batch_idx):\n", " x, z = batch\n", " predicted_z = self(x)\n", " loss = F.mse_loss(predicted_z, z)\n", " return loss\n", "\n", " def validation_step(self, batch, batch_idx):\n", " return self.training_step(batch, batch_idx)\n", "\n", " def configure_optimizers(self):\n", " self.trainer.reset_train_dataloader()\n", " # self.train_dataloader.loaders # access it here.\n", "\n", " optimizer = torch.optim.Adam(self.parameters(), lr=self.max_lr)\n", " scheduler = {'scheduler': torch.optim.lr_scheduler.OneCycleLR(optimizer, max_lr=self.max_lr,\n", " total_steps=self.total_steps,\n", " final_div_factor=1e4),\n", " 'interval': 'step'}\n", " return [optimizer], [scheduler]" ] }, { "cell_type": "markdown", "metadata": { "id": "kK725aSEuUvG" }, "source": [ "## Data bookkeeping" ] }, { "cell_type": "markdown", "metadata": { "id": "KdWVtWUcbDhQ" }, "source": [ "Put everything into PyTorch and do a train/test split:" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "0ym19abgbDhR" }, "outputs": [], "source": [ "Xt = torch.tensor(X).float()\n", "zt = torch.tensor(z).float()\n", "X_train, X_test, z_train, z_test = train_test_split(Xt, zt, random_state=0)\n", "train_set = TensorDataset(X_train, z_train)\n", "train = DataLoader(train_set, batch_size=128, num_workers=2)\n", "test_set = TensorDataset(X_test, z_test)\n", "test = DataLoader(test_set, batch_size=256, num_workers=2)" ] }, { "cell_type": "markdown", "metadata": { "id": "3dw_NefuudIq" }, "source": [ "## Train the model with PyTorch Lightning on GPUs:" ] }, { "cell_type": "markdown", "metadata": { "id": "hhlhLQUBbDhT" }, "source": [ "Start the model:" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "1ldN0999bDhU", "outputId": "269efa86-3331-4ba8-a763-4f26823c8532" }, "outputs": [], "source": [ "pl.seed_everything(0)\n", "model = SumNet()\n", "model.total_steps = total_steps\n", "model.max_lr = 1e-2" ] }, { "cell_type": "markdown", "metadata": { "id": "WWRsu5A9bDhW" }, "source": [ "PyTorch Lightning trainer object:" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "33R2nrv-b62w", "outputId": "20a8f626-bb4b-4d08-d8ed-e14fa80b2402" }, "outputs": [], "source": [ "trainer = pl.Trainer(max_steps=total_steps, gpus=1, benchmark=True)" ] }, { "cell_type": "markdown", "metadata": { "id": "jh91CukM5CkI" }, "source": [ "Here, we fit the neural network:" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 240, "referenced_widgets": [ "8f7ca3dc340c4b6ca7607fd5c94c79d0", "cbf3650d65dc4b8986a569d9700207a2", "b8f6f05741f94fd49901887de806eb1d", "ad40f001306a44ecb54f62ab0adda91f", "e4b89b77f1c94abdbffc3ae9a931a148", "ca1e9af3973845c1b2daee8df04f5050", "d638dbb7e0a846ebaca5487f0f384b75", "441f56016ea143d98f2185b42009cc68", "957b8217f89d449ea226c8d211ca055d", "c5956a7501e649319193f6899e6f94af", "e1cc5d5e17ed43ebbd1420cbfbd06758", "0882e10f5ceb4917b6455b74cf4facfb", "5867c0acc2a84f7196aa782f0dc6d4bc", "a243c86070ac4590a99ec844c2cbf677", "397d0a9190f042d8969d96b766e93b90", "76abe6940c3642ea90bbab0409d47f80", "c3d9456987764530a49f80fd08a8a058", "4419e5228ebb46578d550c3f24096c92", "c1b5d69fe13445179345852b4c5c4a3f", "ff544275991b474981f0e55a01a4739a", "d8747da8b9984e12bf4d7f352a3c0a21", "6c06f7dfa9d84d62b218d8182c164bf9" ] }, "id": "TXZdF8k1bDhY", "outputId": "6b1b7f68-5dd8-4613-95a7-776fc71c841f" }, "outputs": [], "source": [ "trainer.fit(model, train_dataloaders=train, val_dataloaders=test)" ] }, { "cell_type": "markdown", "metadata": { "id": "uYzk0yU4ulfH" }, "source": [ "## Latent vectors of network\n", "\n", "Let's get the input and output of the learned `g` function from the network over some random data:" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "s2sQLla5bDhb", "outputId": "5b48a0b0-6c5e-4e9a-8bfe-e9f888af4b9d" }, "outputs": [], "source": [ "np.random.seed(0)\n", "idx = np.random.randint(0, 10000, size=1000)\n", "\n", "X_for_pysr = Xt[idx]\n", "y_i_for_pysr = model.g(X_for_pysr)[:, :, 0]\n", "y_for_pysr = torch.sum(y_i_for_pysr, dim=1)/y_i_for_pysr.shape[1]\n", "z_for_pysr = zt[idx] #Use true values.\n", "\n", "X_for_pysr.shape, y_i_for_pysr.shape" ] }, { "cell_type": "markdown", "metadata": { "id": "nCCIvvAGuyFi" }, "source": [ "## Learning over the network:\n", "\n", "Now, let's fit `g` using PySR:" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "51QdHVSkbDhc", "outputId": "3058d58a-dbbd-4e78-c810-439a5fca76e6", "scrolled": true }, "outputs": [], "source": [ "np.random.seed(1)\n", "tmpX = X_for_pysr.detach().numpy().reshape(-1, 5)\n", "tmpy = y_i_for_pysr.detach().numpy().reshape(-1)\n", "idx2 = np.random.randint(0, tmpy.shape[0], size=3000)\n", "\n", "model = PySRRegressor(\n", " niterations=20,\n", " binary_operators=[\"plus\", \"sub\", \"mult\"],\n", " unary_operators=[\"cos\", \"square\", \"neg\"]\n", ")\n", "model.fit(X=tmpX[idx2], y=tmpy[idx2])" ] }, { "cell_type": "markdown", "metadata": { "id": "xginVMmTu3MZ" }, "source": [ "## Validation" ] }, { "cell_type": "markdown", "metadata": { "id": "6WuaeqyqbDhe" }, "source": [ "Recall we are searching for $y_i$ above:\n", "\n", "$$ z = y^2,\\quad y = \\frac{1}{100} \\sum(y_i),\\quad y_i = x_{i0}^2 + 6 \\cos(2 x_{i2})$$" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "E1_VWQ45bDhf" }, "outputs": [], "source": [ "model" ] }, { "cell_type": "markdown", "metadata": { "id": "mlU1hidZkgCY" }, "source": [ "A neural network can easily undo a linear transform, so this is fine: the network for $f$ will learn to undo the linear transform.\n", "\n", "Then, we can learn another analytic equation for $z$." ] }, { "cell_type": "markdown", "metadata": { "id": "TntGlQEwbDhk" }, "source": [ "**Now, we can compose these together to get the time series model!**\n", "\n", "Think about what we just did: we found an analytical equation for $z$ in terms of $500$ datapoints, under the assumption that $z$ is a function of a sum of another function over an axis:\n", "\n", "$$ z = f(\\sum_i g(x_i)) $$\n", "\n", "And we pulled out analytical copies for $g$ using symbolic regression." ] }, { "cell_type": "markdown", "metadata": { "id": "1QsHVjAVbDhk" }, "source": [ "# Other PySR Options" ] }, { "cell_type": "markdown", "metadata": { "id": "S5dO61g1bDhk" }, "source": [ "The full list of PySR parameters can be found here: https://pysr.readthedocs.io/en/latest/docs/api-documentation/" ] } ], "metadata": { "accelerator": "GPU", "colab": { "collapsed_sections": [], "name": "PySR_Demo_v4.ipynb", "provenance": [] }, "kernelspec": { "display_name": "Python (main_ipynb)", "language": "python", "name": "main_ipynb" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.8.9" }, "widgets": { "application/vnd.jupyter.widget-state+json": { "0882e10f5ceb4917b6455b74cf4facfb": { "model_module": "@jupyter-widgets/controls", "model_module_version": "1.5.0", "model_name": "HBoxModel", "state": { "_dom_classes": [], "_model_module": "@jupyter-widgets/controls", "_model_module_version": "1.5.0", "_model_name": "HBoxModel", "_view_count": null, "_view_module": "@jupyter-widgets/controls", "_view_module_version": "1.5.0", "_view_name": "HBoxView", "box_style": "", "children": [ "IPY_MODEL_5867c0acc2a84f7196aa782f0dc6d4bc", "IPY_MODEL_a243c86070ac4590a99ec844c2cbf677", "IPY_MODEL_397d0a9190f042d8969d96b766e93b90" ], "layout": "IPY_MODEL_76abe6940c3642ea90bbab0409d47f80" } }, "397d0a9190f042d8969d96b766e93b90": { "model_module": "@jupyter-widgets/controls", "model_module_version": "1.5.0", "model_name": "HTMLModel", "state": { "_dom_classes": [], "_model_module": "@jupyter-widgets/controls", "_model_module_version": "1.5.0", "_model_name": "HTMLModel", "_view_count": null, "_view_module": "@jupyter-widgets/controls", "_view_module_version": "1.5.0", "_view_name": "HTMLView", "description": "", "description_tooltip": null, "layout": "IPY_MODEL_d8747da8b9984e12bf4d7f352a3c0a21", "placeholder": "​", "style": "IPY_MODEL_6c06f7dfa9d84d62b218d8182c164bf9", "value": " 2120/2442 [00:18<00:02, 116.15it/s, loss=6.39, v_num=14]" } }, "4419e5228ebb46578d550c3f24096c92": { "model_module": "@jupyter-widgets/controls", "model_module_version": "1.5.0", "model_name": "DescriptionStyleModel", "state": { "_model_module": "@jupyter-widgets/controls", "_model_module_version": "1.5.0", "_model_name": "DescriptionStyleModel", "_view_count": null, "_view_module": "@jupyter-widgets/base", "_view_module_version": "1.2.0", "_view_name": "StyleView", "description_width": "" } }, "441f56016ea143d98f2185b42009cc68": { "model_module": "@jupyter-widgets/base", "model_module_version": "1.2.0", "model_name": "LayoutModel", "state": { "_model_module": "@jupyter-widgets/base", "_model_module_version": "1.2.0", "_model_name": "LayoutModel", "_view_count": null, "_view_module": "@jupyter-widgets/base", "_view_module_version": "1.2.0", "_view_name": "LayoutView", "align_content": null, "align_items": null, "align_self": null, "border": null, "bottom": null, "display": null, "flex": "2", "flex_flow": null, "grid_area": null, "grid_auto_columns": null, "grid_auto_flow": null, "grid_auto_rows": null, "grid_column": null, "grid_gap": null, "grid_row": null, "grid_template_areas": null, "grid_template_columns": null, "grid_template_rows": null, "height": null, "justify_content": null, "justify_items": null, "left": null, "margin": null, "max_height": null, "max_width": null, "min_height": null, "min_width": null, "object_fit": null, "object_position": null, "order": null, "overflow": null, "overflow_x": null, "overflow_y": null, "padding": null, "right": null, "top": null, "visibility": null, "width": null } }, "5867c0acc2a84f7196aa782f0dc6d4bc": { "model_module": "@jupyter-widgets/controls", "model_module_version": "1.5.0", "model_name": "HTMLModel", "state": { "_dom_classes": [], "_model_module": "@jupyter-widgets/controls", "_model_module_version": "1.5.0", "_model_name": "HTMLModel", "_view_count": null, "_view_module": "@jupyter-widgets/controls", "_view_module_version": "1.5.0", "_view_name": "HTMLView", "description": "", "description_tooltip": null, "layout": "IPY_MODEL_c3d9456987764530a49f80fd08a8a058", "placeholder": "​", "style": "IPY_MODEL_4419e5228ebb46578d550c3f24096c92", "value": "Epoch 0: 87%" } }, "6c06f7dfa9d84d62b218d8182c164bf9": { "model_module": "@jupyter-widgets/controls", "model_module_version": "1.5.0", "model_name": "DescriptionStyleModel", "state": { "_model_module": "@jupyter-widgets/controls", "_model_module_version": "1.5.0", "_model_name": "DescriptionStyleModel", "_view_count": null, "_view_module": "@jupyter-widgets/base", "_view_module_version": "1.2.0", "_view_name": "StyleView", "description_width": "" } }, "76abe6940c3642ea90bbab0409d47f80": { "model_module": "@jupyter-widgets/base", "model_module_version": "1.2.0", "model_name": "LayoutModel", "state": { "_model_module": "@jupyter-widgets/base", "_model_module_version": "1.2.0", "_model_name": "LayoutModel", "_view_count": null, "_view_module": "@jupyter-widgets/base", "_view_module_version": "1.2.0", "_view_name": "LayoutView", "align_content": null, "align_items": null, "align_self": null, "border": null, "bottom": null, "display": "inline-flex", "flex": null, "flex_flow": "row wrap", "grid_area": null, "grid_auto_columns": null, "grid_auto_flow": null, "grid_auto_rows": null, "grid_column": null, "grid_gap": null, "grid_row": null, "grid_template_areas": null, "grid_template_columns": null, "grid_template_rows": null, "height": null, "justify_content": null, "justify_items": null, "left": null, "margin": null, "max_height": null, "max_width": null, "min_height": null, "min_width": null, "object_fit": null, "object_position": null, "order": null, "overflow": null, "overflow_x": null, "overflow_y": null, "padding": null, "right": null, "top": null, "visibility": null, "width": "100%" } }, "8f7ca3dc340c4b6ca7607fd5c94c79d0": { "model_module": "@jupyter-widgets/controls", "model_module_version": "1.5.0", "model_name": "HBoxModel", "state": { "_dom_classes": [], "_model_module": "@jupyter-widgets/controls", "_model_module_version": "1.5.0", "_model_name": "HBoxModel", "_view_count": null, "_view_module": "@jupyter-widgets/controls", "_view_module_version": "1.5.0", "_view_name": "HBoxView", "box_style": "", "children": [ "IPY_MODEL_cbf3650d65dc4b8986a569d9700207a2", "IPY_MODEL_b8f6f05741f94fd49901887de806eb1d", "IPY_MODEL_ad40f001306a44ecb54f62ab0adda91f" ], "layout": "IPY_MODEL_e4b89b77f1c94abdbffc3ae9a931a148" } }, "957b8217f89d449ea226c8d211ca055d": { "model_module": "@jupyter-widgets/controls", "model_module_version": "1.5.0", "model_name": "ProgressStyleModel", "state": { "_model_module": "@jupyter-widgets/controls", "_model_module_version": "1.5.0", "_model_name": "ProgressStyleModel", "_view_count": null, "_view_module": "@jupyter-widgets/base", "_view_module_version": "1.2.0", "_view_name": "StyleView", "bar_color": null, "description_width": "" } }, "a243c86070ac4590a99ec844c2cbf677": { "model_module": "@jupyter-widgets/controls", "model_module_version": "1.5.0", "model_name": "FloatProgressModel", "state": { "_dom_classes": [], "_model_module": "@jupyter-widgets/controls", "_model_module_version": "1.5.0", "_model_name": "FloatProgressModel", "_view_count": null, "_view_module": "@jupyter-widgets/controls", "_view_module_version": "1.5.0", "_view_name": "ProgressView", "bar_style": "", "description": "", "description_tooltip": null, "layout": "IPY_MODEL_c1b5d69fe13445179345852b4c5c4a3f", "max": 2442, "min": 0, "orientation": "horizontal", "style": "IPY_MODEL_ff544275991b474981f0e55a01a4739a", "value": 2120 } }, "ad40f001306a44ecb54f62ab0adda91f": { "model_module": "@jupyter-widgets/controls", "model_module_version": "1.5.0", "model_name": "HTMLModel", "state": { "_dom_classes": [], "_model_module": "@jupyter-widgets/controls", "_model_module_version": "1.5.0", "_model_name": "HTMLModel", "_view_count": null, "_view_module": "@jupyter-widgets/controls", "_view_module_version": "1.5.0", "_view_name": "HTMLView", "description": "", "description_tooltip": null, "layout": "IPY_MODEL_c5956a7501e649319193f6899e6f94af", "placeholder": "​", "style": "IPY_MODEL_e1cc5d5e17ed43ebbd1420cbfbd06758", "value": " 2/2 [00:00<00:00, 55.34it/s]" } }, "b8f6f05741f94fd49901887de806eb1d": { "model_module": "@jupyter-widgets/controls", "model_module_version": "1.5.0", "model_name": "FloatProgressModel", "state": { "_dom_classes": [], "_model_module": "@jupyter-widgets/controls", "_model_module_version": "1.5.0", "_model_name": "FloatProgressModel", "_view_count": null, "_view_module": "@jupyter-widgets/controls", "_view_module_version": "1.5.0", "_view_name": "ProgressView", "bar_style": "", "description": "", "description_tooltip": null, "layout": "IPY_MODEL_441f56016ea143d98f2185b42009cc68", "max": 2, "min": 0, "orientation": "horizontal", "style": "IPY_MODEL_957b8217f89d449ea226c8d211ca055d", "value": 2 } }, "c1b5d69fe13445179345852b4c5c4a3f": { "model_module": "@jupyter-widgets/base", "model_module_version": "1.2.0", "model_name": "LayoutModel", "state": { "_model_module": "@jupyter-widgets/base", "_model_module_version": "1.2.0", "_model_name": "LayoutModel", "_view_count": null, "_view_module": "@jupyter-widgets/base", "_view_module_version": "1.2.0", "_view_name": "LayoutView", "align_content": null, "align_items": null, "align_self": null, "border": null, "bottom": null, "display": null, "flex": "2", "flex_flow": null, "grid_area": null, "grid_auto_columns": null, "grid_auto_flow": null, "grid_auto_rows": null, "grid_column": null, "grid_gap": null, "grid_row": null, "grid_template_areas": null, "grid_template_columns": null, "grid_template_rows": null, "height": null, "justify_content": null, "justify_items": null, "left": null, "margin": null, "max_height": null, "max_width": null, "min_height": null, "min_width": null, "object_fit": null, "object_position": null, "order": null, "overflow": null, "overflow_x": null, "overflow_y": null, "padding": null, "right": null, "top": null, "visibility": null, "width": null } }, "c3d9456987764530a49f80fd08a8a058": { "model_module": "@jupyter-widgets/base", "model_module_version": "1.2.0", "model_name": "LayoutModel", "state": { "_model_module": "@jupyter-widgets/base", "_model_module_version": "1.2.0", "_model_name": "LayoutModel", "_view_count": null, "_view_module": "@jupyter-widgets/base", "_view_module_version": "1.2.0", "_view_name": "LayoutView", "align_content": null, "align_items": null, "align_self": null, "border": null, "bottom": null, "display": null, "flex": null, "flex_flow": null, "grid_area": null, "grid_auto_columns": null, "grid_auto_flow": null, "grid_auto_rows": null, "grid_column": null, "grid_gap": null, "grid_row": null, "grid_template_areas": null, "grid_template_columns": null, "grid_template_rows": null, "height": null, "justify_content": null, "justify_items": null, "left": null, "margin": null, "max_height": null, "max_width": null, "min_height": null, "min_width": null, "object_fit": null, "object_position": null, "order": null, "overflow": null, "overflow_x": null, "overflow_y": null, "padding": null, "right": null, "top": null, "visibility": null, "width": null } }, "c5956a7501e649319193f6899e6f94af": { "model_module": "@jupyter-widgets/base", "model_module_version": "1.2.0", "model_name": "LayoutModel", "state": { "_model_module": "@jupyter-widgets/base", "_model_module_version": "1.2.0", "_model_name": "LayoutModel", "_view_count": null, "_view_module": "@jupyter-widgets/base", "_view_module_version": "1.2.0", "_view_name": "LayoutView", "align_content": null, "align_items": null, "align_self": null, "border": null, "bottom": null, "display": null, "flex": null, "flex_flow": null, "grid_area": null, "grid_auto_columns": null, "grid_auto_flow": null, "grid_auto_rows": null, "grid_column": null, "grid_gap": null, "grid_row": null, "grid_template_areas": null, "grid_template_columns": null, "grid_template_rows": null, "height": null, "justify_content": null, "justify_items": null, "left": null, "margin": null, "max_height": null, "max_width": null, "min_height": null, "min_width": null, "object_fit": null, "object_position": null, "order": null, "overflow": null, "overflow_x": null, "overflow_y": null, "padding": null, "right": null, "top": null, "visibility": null, "width": null } }, "ca1e9af3973845c1b2daee8df04f5050": { "model_module": "@jupyter-widgets/base", "model_module_version": "1.2.0", "model_name": "LayoutModel", "state": { "_model_module": "@jupyter-widgets/base", "_model_module_version": "1.2.0", "_model_name": "LayoutModel", "_view_count": null, "_view_module": "@jupyter-widgets/base", "_view_module_version": "1.2.0", "_view_name": "LayoutView", "align_content": null, "align_items": null, "align_self": null, "border": null, "bottom": null, "display": null, "flex": null, "flex_flow": null, "grid_area": null, "grid_auto_columns": null, "grid_auto_flow": null, "grid_auto_rows": null, "grid_column": null, "grid_gap": null, "grid_row": null, "grid_template_areas": null, "grid_template_columns": null, "grid_template_rows": null, "height": null, "justify_content": null, "justify_items": null, "left": null, "margin": null, "max_height": null, "max_width": null, "min_height": null, "min_width": null, "object_fit": null, "object_position": null, "order": null, "overflow": null, "overflow_x": null, "overflow_y": null, "padding": null, "right": null, "top": null, "visibility": null, "width": null } }, "cbf3650d65dc4b8986a569d9700207a2": { "model_module": "@jupyter-widgets/controls", "model_module_version": "1.5.0", "model_name": "HTMLModel", "state": { "_dom_classes": [], "_model_module": "@jupyter-widgets/controls", "_model_module_version": "1.5.0", "_model_name": "HTMLModel", "_view_count": null, "_view_module": "@jupyter-widgets/controls", "_view_module_version": "1.5.0", "_view_name": "HTMLView", "description": "", "description_tooltip": null, "layout": "IPY_MODEL_ca1e9af3973845c1b2daee8df04f5050", "placeholder": "​", "style": "IPY_MODEL_d638dbb7e0a846ebaca5487f0f384b75", "value": "Sanity Checking DataLoader 0: 100%" } }, "d638dbb7e0a846ebaca5487f0f384b75": { "model_module": "@jupyter-widgets/controls", "model_module_version": "1.5.0", "model_name": "DescriptionStyleModel", "state": { "_model_module": "@jupyter-widgets/controls", "_model_module_version": "1.5.0", "_model_name": "DescriptionStyleModel", "_view_count": null, "_view_module": "@jupyter-widgets/base", "_view_module_version": "1.2.0", "_view_name": "StyleView", "description_width": "" } }, "d8747da8b9984e12bf4d7f352a3c0a21": { "model_module": "@jupyter-widgets/base", "model_module_version": "1.2.0", "model_name": "LayoutModel", "state": { "_model_module": "@jupyter-widgets/base", "_model_module_version": "1.2.0", "_model_name": "LayoutModel", "_view_count": null, "_view_module": "@jupyter-widgets/base", "_view_module_version": "1.2.0", "_view_name": "LayoutView", "align_content": null, "align_items": null, "align_self": null, "border": null, "bottom": null, "display": null, "flex": null, "flex_flow": null, "grid_area": null, "grid_auto_columns": null, "grid_auto_flow": null, "grid_auto_rows": null, "grid_column": null, "grid_gap": null, "grid_row": null, "grid_template_areas": null, "grid_template_columns": null, "grid_template_rows": null, "height": null, "justify_content": null, "justify_items": null, "left": null, "margin": null, "max_height": null, "max_width": null, "min_height": null, "min_width": null, "object_fit": null, "object_position": null, "order": null, "overflow": null, "overflow_x": null, "overflow_y": null, "padding": null, "right": null, "top": null, "visibility": null, "width": null } }, "e1cc5d5e17ed43ebbd1420cbfbd06758": { "model_module": "@jupyter-widgets/controls", "model_module_version": "1.5.0", "model_name": "DescriptionStyleModel", "state": { "_model_module": "@jupyter-widgets/controls", "_model_module_version": "1.5.0", "_model_name": "DescriptionStyleModel", "_view_count": null, "_view_module": "@jupyter-widgets/base", "_view_module_version": "1.2.0", "_view_name": "StyleView", "description_width": "" } }, "e4b89b77f1c94abdbffc3ae9a931a148": { "model_module": "@jupyter-widgets/base", "model_module_version": "1.2.0", "model_name": "LayoutModel", "state": { "_model_module": "@jupyter-widgets/base", "_model_module_version": "1.2.0", "_model_name": "LayoutModel", "_view_count": null, "_view_module": "@jupyter-widgets/base", "_view_module_version": "1.2.0", "_view_name": "LayoutView", "align_content": null, "align_items": null, "align_self": null, "border": null, "bottom": null, "display": "inline-flex", "flex": null, "flex_flow": "row wrap", "grid_area": null, "grid_auto_columns": null, "grid_auto_flow": null, "grid_auto_rows": null, "grid_column": null, "grid_gap": null, "grid_row": null, "grid_template_areas": null, "grid_template_columns": null, "grid_template_rows": null, "height": null, "justify_content": null, "justify_items": null, "left": null, "margin": null, "max_height": null, "max_width": null, "min_height": null, "min_width": null, "object_fit": null, "object_position": null, "order": null, "overflow": null, "overflow_x": null, "overflow_y": null, "padding": null, "right": null, "top": null, "visibility": null, "width": "100%" } }, "ff544275991b474981f0e55a01a4739a": { "model_module": "@jupyter-widgets/controls", "model_module_version": "1.5.0", "model_name": "ProgressStyleModel", "state": { "_model_module": "@jupyter-widgets/controls", "_model_module_version": "1.5.0", "_model_name": "ProgressStyleModel", "_view_count": null, "_view_module": "@jupyter-widgets/base", "_view_module_version": "1.2.0", "_view_name": "StyleView", "bar_color": null, "description_width": "" } } } } }, "nbformat": 4, "nbformat_minor": 0 }