https://arxiv.org/abs/2310.20360 Skip to main content Cornell University Served from the cloud We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate arxiv logo > cs > arXiv:2310.20360 [ ] Help | Advanced Search [All fields ] Search arXiv logo Cornell University Logo [ ] GO quick links * Login * Help Pages * About Computer Science > Machine Learning arXiv:2310.20360 (cs) [Submitted on 31 Oct 2023] Title:Mathematical Introduction to Deep Learning: Methods, Implementations, and Theory Authors:Arnulf Jentzen, Benno Kuckuck, Philippe von Wurstemberger Download a PDF of the paper titled Mathematical Introduction to Deep Learning: Methods, Implementations, and Theory, by Arnulf Jentzen and 2 other authors Download PDF Abstract:This book aims to provide an introduction to the topic of deep learning algorithms. We review essential components of deep learning algorithms in full mathematical detail including different artificial neural network (ANN) architectures (such as fully-connected feedforward ANNs, convolutional ANNs, recurrent ANNs, residual ANNs, and ANNs with batch normalization) and different optimization algorithms (such as the basic stochastic gradient descent (SGD) method, accelerated methods, and adaptive methods). We also cover several theoretical aspects of deep learning algorithms such as approximation capacities of ANNs (including a calculus for ANNs), optimization theory (including Kurdyka-Lojasiewicz inequalities), and generalization errors. In the last part of the book some deep learning approximation methods for PDEs are reviewed including physics-informed neural networks (PINNs) and deep Galerkin methods. We hope that this book will be useful for students and scientists who do not yet have any background in deep learning at all and would like to gain a solid foundation as well as for practitioners who would like to obtain a firmer mathematical understanding of the objects and methods considered in deep learning. Comments: 601 pages, 36 figures, 45 source codes Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Subjects: Numerical Analysis (math.NA); Probability (math.PR); Machine Learning (stat.ML) MSC 68T07 classes: Cite as: arXiv:2310.20360 [cs.LG] (or arXiv:2310.20360v1 [cs.LG] for this version) https://doi.org/10.48550/arXiv.2310.20360 Focus to learn more arXiv-issued DOI via DataCite Submission history From: Benno Kuckuck [view email] [v1] Tue, 31 Oct 2023 11:01:23 UTC (2,327 KB) Full-text links: Access Paper: Download a PDF of the paper titled Mathematical Introduction to Deep Learning: Methods, Implementations, and Theory, by Arnulf Jentzen and 2 other authors * Download PDF * PostScript * Other Formats (view license) Current browse context: cs.LG < prev | next > new | recent | 2310 Change to browse by: cs cs.AI cs.NA math math.NA math.PR stat stat.ML References & Citations * NASA ADS * Google Scholar * Semantic Scholar a export BibTeX citation Loading... BibTeX formatted citation x [loading... ] Data provided by: Bookmark BibSonomy logo Reddit logo (*) Bibliographic Tools Bibliographic and Citation Tools [ ] Bibliographic Explorer Toggle Bibliographic Explorer (What is the Explorer?) [ ] Litmaps Toggle Litmaps (What is Litmaps?) [ ] scite.ai Toggle scite Smart Citations (What are Smart Citations?) ( ) Code, Data, Media Code, Data and Media Associated with this Article [ ] Links to Code Toggle CatalyzeX Code Finder for Papers (What is CatalyzeX?) [ ] DagsHub Toggle DagsHub (What is DagsHub?) [ ] Links to Code Toggle Papers with Code (What is Papers with Code?) [ ] ScienceCast Toggle ScienceCast (What is ScienceCast?) ( ) Demos Demos [ ] Replicate Toggle Replicate (What is Replicate?) [ ] Spaces Toggle Hugging Face Spaces (What is Spaces?) [ ] Spaces Toggle TXYZ.AI (What is TXYZ.AI?) ( ) Related Papers Recommenders and Search Tools [ ] Link to Influence Flower Influence Flower (What are Influence Flowers?) [ ] Connected Papers Toggle Connected Papers (What is Connected Papers?) [ ] Core recommender toggle CORE Recommender (What is CORE?) [ ] IArxiv recommender toggle IArxiv Recommender (What is IArxiv?) * Author * Venue * Institution * Topic ( ) About arXivLabs arXivLabs: experimental projects with community collaborators arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website. Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them. Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs. Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?) * About * Help * Click here to contact arXiv Contact * Click here to subscribe Subscribe * Copyright * Privacy Policy * Web Accessibility Assistance * arXiv Operational Status Get status notifications via email or slack