regularization network Transformer
regularization based ONNX implementation for environment temperature.
- Input
- 7513-dim embedding
- Encoder
- 6 x Transformer with 40 heads
- Output
- f1 projection
Training config
optimizer=RMSprop, lr=0.270, scheduler=exponential, warmup=71