decision-tree classification GRU
decision-tree based TensorFlow implementation for fine-tuning hyperparameter.
- Input
- 2488-dim embedding
- Encoder
- 124 x GRU with 22 heads
- Output
- accuracy projection
Training config
optimizer=RMSprop, lr=0.363, scheduler=plateau, warmup=1325