# Running: You can execute the program from the command line with ```bash julia --threads 6 -O3 -e 'include("paralleleureqa.jl"); fullRun(20, npop=100, annealing=true, ncyclesperiteration=5000, fractionReplaced=0.1f0, verbosity=round(Int32, 1e9), topn=10)' ``` Modify the hyperparameters in `hyperparams.jl` and the dataset in `dataset.jl` (see below for options). The first argument to `fullRun` is the number of migration periods to run, with `ncyclesperiteration` determining how many generations per migration period. `npop` is the number of population members. `annealing` determines whether to stay in exploration mode, or tune it down with each cycle. `fractionReplaced` is how much of the population is replaced by migrated equations each step. `topn` is the number of top members of each population to migrate. Run it with threading turned on using: `julia --threads auto -O3 myfile.jl` ## Modification You can change the binary and unary operators in `hyperparams.jl` here: ```julia const binops = [plus, mult] const unaops = [sin, cos, exp]; ``` E.g., you can add the function for powers with: ```julia pow(x::Float32, y::Float32)::Float32 = sign(x)*abs(x)^y const binops = [plus, mult, pow] ``` You can change the dataset here: ```julia const X = convert(Array{Float32, 2}, randn(100, 5)*2) # Here is the function we want to learn (x2^2 + cos(x3) - 5) const y = convert(Array{Float32, 1}, ((cx,)->cx^2).(X[:, 2]) + cos.(X[:, 3]) .- 5) ``` by either loading in a dataset, or modifying the definition of `y`. (The `.` are are used for vectorization of a scalar function) ### Hyperparameters Annealing allows each evolutionary cycle to turn down the exploration rate over time: at the end (temperature 0), it will only select solutions better than existing solutions. The following parameter, parsimony, is how much to punish complex solutions: ```julia const parsimony = 0.01 ``` Finally, the following determins how much to scale temperature by (T between 0 and 1). ```julia const alpha = 10.0 ``` Larger alpha means more exploration. One can also adjust the relative probabilities of each operation here: ```julia weights = [8, 1, 1, 1, 0.1, 0.5, 2] ``` for: 1. Perturb constant 2. Mutate operator 3. Append a node 4. Delete a subtree 5. Simplify equation 6. Randomize completely 7. Do nothing # TODO - [ ] Hyperparameter tune - [ ] Add mutation for constant<->variable - [ ] Create a Python interface - [ ] Create a benchmark for accuracy - [ ] Create struct to pass through all hyperparameters, instead of treating as constants - Make sure doesn't affect performance - [ ] Use NN to generate weights over all probability distribution conditional on error and existing equation, and train on some randomly-generated equations - [ ] Performance: - [ ] Use an enum for functions instead of storing them? - Current most expensive operations: - [x] deepcopy() before the mutate, to see whether to accept or not. - Seems like its necessary right now. But still by far the slowest option. - [ ] Calculating the loss function - there is duplicate calculations happening. - [ ] Declaration of the weights array every iteration - [x] Explicit constant optimization on hall-of-fame - Create method to find and return all constants, from left to right - Create method to find and set all constants, in same order - Pull up some optimization algorithm and add it. Keep the package small! - [x] Create a benchmark for speed - [x] Simplify subtrees with only constants beneath them. Or should I? Maybe randomly simplify sometimes? - [x] Record hall of fame - [x] Optionally (with hyperparameter) migrate the hall of fame, rather than current bests - [x] Test performance of reduced precision integers - No effect