mac999 commited on
Commit
66510b9
·
verified ·
1 Parent(s): 7fa1148

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +67 -3
README.md CHANGED
@@ -1,3 +1,67 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Earthwork Network Architecture (ENA)
2
+
3
+ ## Overview
4
+
5
+ The Earthwork Network Architecture (ENA) is the deep learning model designed to compare the accurate estimation prediction of earthwork quantities. This repository includes four distinct deep learning models—MLP, LSTM, Transformers, and LLM-based architectures (BERT)—tailored for automating and enhancing earthwork quantity estimation from CAD-based cross-sectional drawings.
6
+
7
+ ### Key Features:
8
+ 1. **Multi-Model Approach**:
9
+ - **MLP-Based Model**: Lightweight and efficient for smaller datasets.
10
+ - **LSTM-Based Model**: Ideal for sequential dependencies in time-series data.
11
+ - **Transformer-Based Model**: Handles complex relationships and large datasets.
12
+ - **LLM-Based Model (BERT)**: Excels in processing contextual and unstructured data.
13
+
14
+ 2. **Automated Data Processing**:
15
+ - Converts CAD cross-sectional drawings into structured datasets.
16
+ - Employs a Half-Edge Topology Structure to tokenize and preprocess geometrical features.
17
+
18
+ 3. **Enhanced Performance**:
19
+ - Provides superior accuracy in Quantity Takeoff Classification (QTC) with reduced loss metrics.
20
+ - Demonstrates robust generalization for unseen datasets, validated through a real-world road construction project.
21
+
22
+ ### Research Basis:
23
+ The ENA is detailed in the paper *Earthwork Network Architecture (ENA): Research for Earthwork Quantity Estimation Method Improvement with Large Language Model*. It showcases a comparative analysis of the ENA models and demonstrates the advantages of LLM-based approaches in construction engineering.
24
+
25
+ ## Usage
26
+
27
+ ### Prerequisites
28
+ - **Programming Language**: Python 3.8 or above.
29
+ - **Libraries**: Install the required libraries using `pip install`. Detailed dependencies will be provided in the code files.
30
+
31
+ ### Data Preparation
32
+ 1. **Input Data**:
33
+ - Prepare CAD cross-sectional drawings as input files.
34
+ - Use the provided scripts to preprocess and tokenize geometrical features.
35
+
36
+ 2. **Training Data**:
37
+ - Features are tokenized into sequences for MLP, LSTM, Transformers, and LLM models.
38
+
39
+ ### Training and Evaluation
40
+ 1. Select the model architecture (`MLP`, `LSTM`, `Transformer`, or `LLM`).
41
+ 2. Configure hyperparameters (batch size, learning rate, etc.) as required.
42
+ 3. Run the training script:
43
+ ```bash
44
+ python train_ena_model.py --model_type [MLP|LSTM|Transformer|LLM]
45
+ ```
46
+ 4. Evaluate the model using the test dataset:
47
+ ```bash
48
+ python evaluate_ena_model.py --model_path [path/to/trained/model]
49
+ ```
50
+
51
+ ## Results
52
+ - **Best Model**: LLM-based ENA achieved a QTC accuracy of **97.17%**, outperforming other architectures in accuracy and stability.
53
+ - **Performance Trade-Offs**: LLMs provide high accuracy but require significant computational resources compared to other models.
54
+
55
+ ## Coming Soon
56
+ - Source codes for ENA models.
57
+ - Step-by-step tutorials for dataset preparation and model training.
58
+
59
+ ## License
60
+ This project is licensed under the MIT License.
61
+
62
+ ## Citation
63
+ If you use this repository, please cite:
64
+ ```
65
+ Kang, T.; Kang, K. Earthwork Network Architecture (ENA): Research for Earthwork Quantity Estimation Method Improvement with Large Language Models. Appl. Sci. 2024, 14, 10517.
66
+ https://doi.org/10.3390/app142210517
67
+ ```