https://arxiv.org/abs/2310.20360

Skip to main content
Cornell University
Served from the cloud
We gratefully acknowledge support from the Simons Foundation, member
institutions, and all contributors. Donate
 
arxiv logo > cs > arXiv:2310.20360
[                    ]

Help | Advanced Search

[All fields        ]
Search
arXiv logo
Cornell University Logo
[                    ] GO
quick links

  * Login
  * Help Pages
  * About

Computer Science > Machine Learning

arXiv:2310.20360 (cs)
[Submitted on 31 Oct 2023]

Title:Mathematical Introduction to Deep Learning: Methods,
Implementations, and Theory

Authors:Arnulf Jentzen, Benno Kuckuck, Philippe von Wurstemberger
Download a PDF of the paper titled Mathematical Introduction to Deep
Learning: Methods, Implementations, and Theory, by Arnulf Jentzen and
2 other authors
Download PDF

    Abstract:This book aims to provide an introduction to the topic
    of deep learning algorithms. We review essential components of
    deep learning algorithms in full mathematical detail including
    different artificial neural network (ANN) architectures (such as
    fully-connected feedforward ANNs, convolutional ANNs, recurrent
    ANNs, residual ANNs, and ANNs with batch normalization) and
    different optimization algorithms (such as the basic stochastic
    gradient descent (SGD) method, accelerated methods, and adaptive
    methods). We also cover several theoretical aspects of deep
    learning algorithms such as approximation capacities of ANNs
    (including a calculus for ANNs), optimization theory (including
    Kurdyka-Lojasiewicz inequalities), and generalization errors. In
    the last part of the book some deep learning approximation
    methods for PDEs are reviewed including physics-informed neural
    networks (PINNs) and deep Galerkin methods. We hope that this
    book will be useful for students and scientists who do not yet
    have any background in deep learning at all and would like to
    gain a solid foundation as well as for practitioners who would
    like to obtain a firmer mathematical understanding of the objects
    and methods considered in deep learning.

Comments: 601 pages, 36 figures, 45 source codes
          Machine Learning (cs.LG); Artificial Intelligence (cs.AI);
Subjects: Numerical Analysis (math.NA); Probability (math.PR);
          Machine Learning (stat.ML)
MSC       68T07
classes:
Cite as:  arXiv:2310.20360 [cs.LG]
          (or arXiv:2310.20360v1 [cs.LG] for this version)
          https://doi.org/10.48550/arXiv.2310.20360
          Focus to learn more
          arXiv-issued DOI via DataCite

Submission history

From: Benno Kuckuck [view email]
[v1] Tue, 31 Oct 2023 11:01:23 UTC (2,327 KB)
Full-text links:

Access Paper:

    Download a PDF of the paper titled Mathematical Introduction to
    Deep Learning: Methods, Implementations, and Theory, by Arnulf
    Jentzen and 2 other authors
  * Download PDF
  * PostScript
  * Other Formats

(view license)
Current browse context:
cs.LG
< prev   |   next >
new | recent | 2310
Change to browse by:
cs
cs.AI
cs.NA
math
math.NA
math.PR
stat
stat.ML

References & Citations

  * NASA ADS
  * Google Scholar
  * Semantic Scholar

a export BibTeX citation Loading...

BibTeX formatted citation

x
[loading...          ]
Data provided by:

Bookmark

BibSonomy logo Reddit logo
(*) Bibliographic Tools

Bibliographic and Citation Tools

[ ] Bibliographic Explorer Toggle
Bibliographic Explorer (What is the Explorer?)
[ ] Litmaps Toggle
Litmaps (What is Litmaps?)
[ ] scite.ai Toggle
scite Smart Citations (What are Smart Citations?)
( ) Code, Data, Media

Code, Data and Media Associated with this Article

[ ] Links to Code Toggle
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
[ ] DagsHub Toggle
DagsHub (What is DagsHub?)
[ ] Links to Code Toggle
Papers with Code (What is Papers with Code?)
[ ] ScienceCast Toggle
ScienceCast (What is ScienceCast?)
( ) Demos

Demos

[ ] Replicate Toggle
Replicate (What is Replicate?)
[ ] Spaces Toggle
Hugging Face Spaces (What is Spaces?)
[ ] Spaces Toggle
TXYZ.AI (What is TXYZ.AI?)
( ) Related Papers

Recommenders and Search Tools

[ ] Link to Influence Flower
Influence Flower (What are Influence Flowers?)
[ ] Connected Papers Toggle
Connected Papers (What is Connected Papers?)
[ ] Core recommender toggle
CORE Recommender (What is CORE?)
[ ] IArxiv recommender toggle
IArxiv Recommender (What is IArxiv?)

  * Author
  * Venue
  * Institution
  * Topic

( ) About arXivLabs

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and
share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have
embraced and accepted our values of openness, community, excellence,
and user data privacy. arXiv is committed to these values and only
works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community?
Learn more about arXivLabs.

Which authors of this paper are endorsers? | Disable MathJax (What is
MathJax?)

  * About
  * Help

  * Click here to contact arXiv Contact
  * Click here to subscribe Subscribe

  * Copyright
  * Privacy Policy

  * Web Accessibility Assistance
  * arXiv Operational Status
    Get status notifications via email or slack