File size: 1,259 Bytes
30ca9fe
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3a97058
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
30ca9fe
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
---
license: mit
datasets:
- m7alek/ninth_file
- m7alek/external_df
- m7alek/eighth_file
language:
- ar
- en
metrics:
- accuracy
- bertscore
base_model:
- nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
new_version: google/gemma-7b-aps-it
library_name: transformers
tags:
- text-generation-inference
---
This directory includes a few sample datasets to get you started.

*   `california_housing_data*.csv` is California housing data from the 1990 US
    Census; more information is available at:
    https://docs.google.com/document/d/e/2PACX-1vRhYtsvc5eOR2FWNCwaBiKL6suIOrxJig8LcSBbmCbyYsayia_DvPOOBlXZ4CAlQ5nlDD8kTaIDRwrN/pub

*   `mnist_*.csv` is a small sample of the
    [MNIST database](https://en.wikipedia.org/wiki/MNIST_database), which is
    described at: http://yann.lecun.com/exdb/mnist/

*   `anscombe.json` contains a copy of
    [Anscombe's quartet](https://en.wikipedia.org/wiki/Anscombe%27s_quartet); it
    was originally described in

    Anscombe, F. J. (1973). 'Graphs in Statistical Analysis'. American
    Statistician. 27 (1): 17-21. JSTOR 2682899.

    and our copy was prepared by the
    [vega_datasets library](https://github.com/altair-viz/vega_datasets/blob/4f67bdaad10f45e3549984e17e1b3088c731503d/vega_datasets/_data/anscombe.json).