Teaching Data Science

265

135

Yogesh Haribhau Kulkarni

Added on September 14, 2024

Repository for LaTeX course notes for Python, Machine Learning, Deep Learning, Natural Language Processing, etc. Core content is in the form for Beamer slides, which in turn can get compiled into presentation mode as well as two-column course notes mode.

Teaching Data Science

Repository for LaTeX course notes for Python, Machine Learning, Deep Learning, Natural Language Processing, etc. Core content is in the form for Beamer slides, which in turn can get compiled into presentation mode as well as two-column course notes mode.

Yogesh Haribhau Kulkarni

README.md

Repository for LaTeX course notes for Python, Machine Learning, Deep Learning, Natural Language Processing, etc. Core content is in the form for Beamer slides, which in turn can get compiled into presentation pdf as well as two-column course notes pdf.

All tex sources and images have been open sourced as I have built them by referring to material from others, learnt from others, although I have added some of mine, I need to give it back!!

LinkedIn post: https://www.linkedin.com/feed/update/urn:li:activity:6523000857385103360

  • Mission: To spread knowledge of Data Science related subjects to a wider audience.
  • Vision: Let many participate in the industry which is aiming for "From Automation, To Autonomy!!"
  • Values: Giving back, paying it forward!!
  • Goal: "From Automation to Autonomy"!!

Copyright (C) 2019 Yogesh H Kulkarni

Steps for Usage

Code Arrangement

  • LaTeX directory * Has tex sources along with necessary images * Naming: subject_maintopic_subtopic.tex eg maths_linearalgebra_matrices.tex * Main_[|Seminar|Workshop|Course]_[Presentation/CheatSheet].tex are the driver files * Seminar is of about 1 hour, Workshop is about 1 day and Course is of 1 week/semester * They in-turn contain common content files, which include the actual source files * Make specific bat files compile the appropriate sources
  • Code directory * Has running python or ipython notebook files with necessary images and data * Naming should be such that corresponding latex file can be associated * Library based tex file, say, sklearn_decisiontree.tex will have just template code and short fully working examples from Mastering Machine Learning whereas the sklearn_decisiontree.ipynb will have running workflows. No need to sync both. You may want to keep ipynb’s pdf in LaTeX/images directory for reference
  • References directory (not uploaded, as it is mostly from others github repos, nothing much of mine) * Has papers, code, ppts as base material to be used for content preparation

Requirements

  • LaTeX (tested with MikTex 2.9 on Windows 7, 64bit)
  • Need to install LaTeX packages, as and when you get such warning/suggestions.
  • Using TexWorks as IDE

How to Run LaTeX:

  • Driver files for the courses are named with "Main_Workshop/Seminar__CheatSheet/Presentation.tex"
  • Both the Cheatsheet (meaning course notes in two column format) and Presentation.tex refer to same core content file, which in turn contains are the topic files.
  • Run make bat for the course you need. Inside, its just a texify command, so you can modify it as per your OS.
  • You can compile individual "Main_Workshop/Seminar__CheatSheet/Presentation.tex" also using your LaTeX system.
  • Instead of these given driver files, you can have your own main files and include just the *content.tex files.

How to Contribute

  • Go to 'LaTeX' folder.
  • Copy your images into 'images' folder and source code to 'src' folder
  • Sample files are kept to copy and modify, they are Main_Sample_Presentation.tex, Main_Sample_CheatSheet.tex, both of them call sample_content.tex.
  • Fill your material directly in the content file or have further organization into multiple files and then input them in the content file.

Disclaimer:

  • Author (firstnamelastname at yahoo dot com) gives no guarantee of the correctness of the content. Notes have been built using lots of publicly available material.
  • Although care has been taken to cite the original sources as much as possible, but there could be some missing ones. Do point them and I will update wherever possible.
  • Lots of improvements are still to be made. So, don’t depend on it at all, fully.
  • Do send in your suggestions/comments/corrections/Pull-requests.
Related Content

YoloV7 in Tensorflow.js

This repository is an implementation of Yolov7 using Tensorflow.js. The code runs directly on the browser and the detector was trained on the MS COCO dataset to recognizes up to 80 different classes.
GitHubUpdated 17 months ago

machine_learning_complete

This is a comprehensive repository containing 30+ notebooks on Python programming, data manipulation, data analysis, data visualization, data cleaning, classical machine learning, Computer Vision and Natural Language Processing(NLP).
GitHubUpdated 18 months ago

Semantic Sementation model within ML pipeline

This repository shows how to build a Machine Learning Pipeline for Semantic Segmentation task with TensorFlow Extended(TFX) and various GCP products such as Vertex Pipeline, Training, and Enpoint.
GitHubUpdated 21 months ago

Care4ther-

This model will help the users to get an idea of the accidents well before the accident took place. For the people safety, by the people safety
GitHubUpdated 26 months ago

MLOps for Vision Models from 🤗 with TFX

This repository shows how to build Machine Learning pipeline for a vision model (TensorFlow) from 🤗 Transformers using the TensorFlow Ecosystem.
GitHubUpdated 29 months ago

We serve cookies on this site to analyze traffic, remember your preferences, and optimize your experience.