# Neural nets - Number of neurons per layer - Number of layers - [[Optimizers]] - [[SGD]] + [[momentum]] - [[Adam]] / [[Adadelta]] / [[Adagrad]] / ... - In practice lead to more overfitting - [[Batch size]] - [[Learning rate]] - [[Regularization]] - [[L2]]/[[L1]] for weights - [[Dropout]]/[[Dropconnect]] - [[Static dropconnect]]