jjz5463 commited on
Commit
28e07a2
·
1 Parent(s): df8b1f1

pipeline in Baseline-Experimental.ipynb

Browse files
Experiments/Baseline-Experimental.ipynb CHANGED
@@ -8,7 +8,45 @@
8
  "collapsed": true
9
  },
10
  "outputs": [],
11
- "source": []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  }
13
  ],
14
  "metadata": {
 
8
  "collapsed": true
9
  },
10
  "outputs": [],
11
+ "source": [
12
+ "# Utilize the Google Cloud Vision API to recognize text in the input images (diary images), https://cloud.google.com/vision.\n"
13
+ ]
14
+ },
15
+ {
16
+ "cell_type": "code",
17
+ "execution_count": null,
18
+ "outputs": [],
19
+ "source": [
20
+ "# Utilize the PaLM 2 Bison for Text model to conduct NLP tasks such as text summarization and condensing on the diary text, https://ai.google.dev/palm_docs/palm.\n"
21
+ ],
22
+ "metadata": {
23
+ "collapsed": false
24
+ },
25
+ "id": "204930c13fd1e579"
26
+ },
27
+ {
28
+ "cell_type": "code",
29
+ "execution_count": null,
30
+ "outputs": [],
31
+ "source": [
32
+ "# Utilize the Gemini 1.0 Pro Vision to input an image of the diary writer, and output a textual description of the image, https://ai.google.dev/gemini-api/docs/models/gemini.\n"
33
+ ],
34
+ "metadata": {
35
+ "collapsed": false
36
+ },
37
+ "id": "7f0c7d788b8de177"
38
+ },
39
+ {
40
+ "cell_type": "code",
41
+ "execution_count": null,
42
+ "outputs": [],
43
+ "source": [
44
+ "# Now that you have text from the diary and text describing the diary writer, you can utilize the SDXL-Turbo stable diffusion model to generate images https://huggingface.co/stabilityai/sdxl-turbo. You can try to output several images for a diary entry. Analyze how accurate the results, and think about what could be improved.\n"
45
+ ],
46
+ "metadata": {
47
+ "collapsed": false
48
+ },
49
+ "id": "c475ca58dea760da"
50
  }
51
  ],
52
  "metadata": {