# Neural nets

- Number of neurons per layer
- Number of layers
- [[Optimizers]]
  - [[SGD]] + [[momentum]]
  - [[Adam]] / [[Adadelta]] / [[Adagrad]] / ...
    - In practice lead to more overfitting

- [[Batch size]]
- [[Learning rate]]
- [[Regularization]]
  - [[L2]]/[[L1]] for weights
  - [[Dropout]]/[[Dropconnect]]
  - [[Static dropconnect]]