ayushnoori commited on
Commit
3a2cd91
Β·
1 Parent(s): 4e9445d

Add demonstration

Browse files
Files changed (2) hide show
  1. README.md +1 -1
  2. app.py +92 -7
README.md CHANGED
@@ -92,7 +92,7 @@ Synthesis Results:
92
 
93
  To add additional input-output examples, modify `examples.py`. Add a new key to the dictionary `example_set` and set the value to be a list of tuples.
94
 
95
- ## πŸ”Ž Algorithmic Description
96
 
97
  The most important data structure in this implementation is the abstract syntax tree (AST). The AST is a tree representation of a program, where each node is either a primitive or a compound expression. The AST is represented by the `OperatorNode` class in `abstract_syntax_tree.py`. My AST implementation includes functions to recursively evaluate the operator and its operands and also to generate a string representation of the program.
98
 
 
92
 
93
  To add additional input-output examples, modify `examples.py`. Add a new key to the dictionary `example_set` and set the value to be a list of tuples.
94
 
95
+ ## πŸ”Ž Algorithm Details
96
 
97
  The most important data structure in this implementation is the abstract syntax tree (AST). The AST is a tree representation of a program, where each node is either a primitive or a compound expression. The AST is represented by the `OperatorNode` class in `abstract_syntax_tree.py`. My AST implementation includes functions to recursively evaluate the operator and its operands and also to generate a string representation of the program.
98
 
app.py CHANGED
@@ -17,11 +17,6 @@ import config
17
  # write streamlit title
18
  st.title("Bottom-Up Program Synthesis")
19
 
20
- # create class to hold arguments instead of arg-parser
21
- class Args(object):
22
- pass
23
- args = Args()
24
-
25
  st.markdown('''
26
  Completed for [CS252R: Program Synthesis](https://synthesis.metareflection.club/) at the Harvard John A. Paulson School of Engineering and Applied Sciences, taught in Fall 2023 by Prof. Nada Amin.
27
  ''')
@@ -32,7 +27,97 @@ st.markdown('''
32
  Here, we implement the non-ML subset of BUSTLE, the algorithm proposed by [Odena *et al.* (2021)](https://arxiv.org/abs/2007.14381). That is, we implement bottom-up enumerative search for simple compound expressions, excluding conditionals, recursion, and loops. The implementation is generic and flexibly supports multiple target languages. Arithmetic and string manipulations are natively supported, defined in `arithmetic.py` and `string.py`, respectively.
33
  ''')
34
 
35
- st.header("πŸ”Ž Algorithmic Description")
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
36
 
37
  st.markdown('''
38
  The most important data structure in this implementation is the abstract syntax tree (AST). The AST is a tree representation of a program, where each node is either a primitive or a compound expression. The AST is represented by the `OperatorNode` class in `abstract_syntax_tree.py`. My AST implementation includes functions to recursively evaluate the operator and its operands and also to generate a string representation of the program.
@@ -40,7 +125,7 @@ The most important data structure in this implementation is the abstract syntax
40
  At program evaluation time, the AST is evaluated from the bottom up. That is, the operands are evaluated first, and then the operator is evaluated on the operands. This is implemented in the `evaluate` method of the `OperatorNode` class. In the case of integers, variable inputs are represented by the `IntegerVariable` class in `arithmetic.py`. When input is not `None`, input type checking and validation is performed by the `evaluate` function in this class.
41
 
42
  The pseudocode for the bottom-up synthesis algorithm is reproduced below from [Odena *et al.* (2021)](https://arxiv.org/abs/2007.14381):
43
- '''
44
 
45
  st.image("https://github.com/ayushnoori/program-synthesis/assets/43010710/117e7797-11af-4b72-b5f4-dda95eb2260f")
46
 
 
17
  # write streamlit title
18
  st.title("Bottom-Up Program Synthesis")
19
 
 
 
 
 
 
20
  st.markdown('''
21
  Completed for [CS252R: Program Synthesis](https://synthesis.metareflection.club/) at the Harvard John A. Paulson School of Engineering and Applied Sciences, taught in Fall 2023 by Prof. Nada Amin.
22
  ''')
 
27
  Here, we implement the non-ML subset of BUSTLE, the algorithm proposed by [Odena *et al.* (2021)](https://arxiv.org/abs/2007.14381). That is, we implement bottom-up enumerative search for simple compound expressions, excluding conditionals, recursion, and loops. The implementation is generic and flexibly supports multiple target languages. Arithmetic and string manipulations are natively supported, defined in `arithmetic.py` and `string.py`, respectively.
28
  ''')
29
 
30
+ st.subheader("Input-Output Examples")
31
+
32
+ st.markdown('''
33
+ Select input-output examples as defined in `examples.py`, or define your own custom examples. The examples are used to synthesize a program that satisfies the examples.
34
+ ''')
35
+
36
+ # define variables
37
+ domain = "arithmetic"
38
+ examples_key = "addition"
39
+ max_weight = 3
40
+
41
+ # retrieve selected input-output examples
42
+ examples = example_set[examples_key]
43
+
44
+ # extract constants from examples
45
+ program_bank = extract_constants(examples)
46
+ program_bank_str = [p.str() for p in program_bank]
47
+ print("\nSynthesis Log:")
48
+ print(f"- Extracted {len(program_bank)} constants from examples.")
49
+
50
+ # define operators
51
+ if domain == "arithmetic":
52
+ operators = arithmetic_operators
53
+ elif domain == "strings":
54
+ operators = string_operators
55
+ else:
56
+ raise Exception('Domain not recognized. Must be either "arithmetic" or "string".')
57
+
58
+ # define final program
59
+ final_program = None
60
+
61
+ start_time = time.time()
62
+ # iterate over each level
63
+ for weight in range(2, max_weight):
64
+
65
+ # print message
66
+ print(f"- Searching level {weight} with {len(program_bank)} primitives.")
67
+
68
+ # iterate over each operator
69
+ for op in operators:
70
+
71
+ # get all possible combinations of primitives in program bank
72
+ combinations = itertools.combinations(program_bank, op.arity)
73
+
74
+ # iterate over each combination
75
+ for combination in combinations:
76
+
77
+ # get type signature
78
+ type_signature = [p.type for p in combination]
79
+
80
+ # check if type signature matches operator
81
+ if type_signature != op.arg_types:
82
+ continue
83
+
84
+ # check that sum of weights of arguments <= w
85
+ if sum([p.weight for p in combination]) > weight:
86
+ continue
87
+
88
+ # create new program
89
+ program = OperatorNode(op, combination)
90
+
91
+ # check if program is in program bank using string representation
92
+ if program.str() in program_bank_str:
93
+ continue
94
+
95
+ # check if program is observationally equivalent to any program in program bank
96
+ if any([observationally_equivalent(program, p, examples) for p in program_bank]):
97
+ continue
98
+
99
+ # add program to program bank
100
+ program_bank.append(program)
101
+ program_bank_str.append(program.str())
102
+
103
+ # check if program passes all examples
104
+ if check_program(program, examples):
105
+ final_program = program
106
+
107
+ end_time = time.time()
108
+ elapsed_time = round(end_time - start_time, 4)
109
+
110
+ # check if program was found
111
+ print("\nSynthesis Results:")
112
+ if final_program is None:
113
+ print(f"- Max weight of {max_weight} reached, no program found in {elapsed_time}s.")
114
+ else:
115
+ print(f"- Program found in {elapsed_time}s.")
116
+ print(f"- Program: {final_program.str()}")
117
+ print(f"- Program weight: {final_program.weight}")
118
+ print(f"- Program return type: {final_program.type.__name__}")
119
+
120
+ st.header("πŸ”Ž Algorithm Details")
121
 
122
  st.markdown('''
123
  The most important data structure in this implementation is the abstract syntax tree (AST). The AST is a tree representation of a program, where each node is either a primitive or a compound expression. The AST is represented by the `OperatorNode` class in `abstract_syntax_tree.py`. My AST implementation includes functions to recursively evaluate the operator and its operands and also to generate a string representation of the program.
 
125
  At program evaluation time, the AST is evaluated from the bottom up. That is, the operands are evaluated first, and then the operator is evaluated on the operands. This is implemented in the `evaluate` method of the `OperatorNode` class. In the case of integers, variable inputs are represented by the `IntegerVariable` class in `arithmetic.py`. When input is not `None`, input type checking and validation is performed by the `evaluate` function in this class.
126
 
127
  The pseudocode for the bottom-up synthesis algorithm is reproduced below from [Odena *et al.* (2021)](https://arxiv.org/abs/2007.14381):
128
+ ''')
129
 
130
  st.image("https://github.com/ayushnoori/program-synthesis/assets/43010710/117e7797-11af-4b72-b5f4-dda95eb2260f")
131