Обсуждение:Численные методы обучения по прецедентам (практика, В.В. Стрижов)
Материал из MachineLearning.
Основная страница будет постепенно переводиться на английский в связи с значимым числом запросов. 9.2.2014
|
Introduction
’’Machine Learning and Data Analysis’’ is a practical course that focuses on methods for scientific research. The course teaches students how to conduct research projects in the field of machine learning and data analysis. The abstract goal is to learn to convey ideas in precise, clear and elegant way; specific goal is to write a research paper, accepted by other researchers from the field of Machine Learning and Data Analysis; make a report. Expected result is a research paper, submitted to a peer-reviewed journal from the list, composed by the Higher Attestation Commission.
The course introduces students to technologies used in scientific research and teaches them to present the results of their studies in the correct format, as used by other researchers from the field of machine learning and data analysis. By the end of this term, each student is expected to write a research paper and submit it to a peer-reviewed journal from the list, composed by the Higher Attestation Commission. During the course the students learn the basics of scientific writing and designing computational experiments, using associated tools such as markdown system LaTeX, bibliographic system BibTeX, and computing environment MATLAB.
The work on a project includes exploring the literature, writing mathematical problem statement and algorithm description, investigating the its properties, and running computational experiments. Each student selects a personal problem from the list of suggested research topics. The student analyzes recent publications on the selected topic, formulates the problem and presents it to the group. Then the student performs mathematical description and analysis of suggested methods, followed by an intermediate report. The last step is to run computational experiments to illustrate the method's properties using real or synthetic data. Each paper undergoes a revision process with the student's peers acting as reviewers. The works are syncronized via SourceForge.org, at the project ’’MLAlgorithms’’.
Course format. Each project is aided by an assistant and an expert. A student is willing to learn to formally state research problems, find adequate references, generate novel and significant ideas for problem solving.
An assistant helps the student with technical issues, consults the student on topics of machine learning, promptly reacts to arising problems, performs evaluations and grading. Each assistant is supposed to possess sufficient publishing experience. Ideally, the advisor is writing paper on the adjacent topic. It is recommended to organize weekly reviewing process in such way that a student would input the corrections himself.
An expert guarantees novelty and importance of the paper, suggests the problems, provides data.
Course-related materials
- Brief description of the course: goals, structure and grading policy CourseShort.pdf
- Библиография всех завершенных проектов (191 проект на декабрь 2014)
- Slides in PDF with course overview (goals, syllabus, summary of 2009-2014 results) CourseSlides.pdf
- Basic schedule with the list of tasks to complete
- Report on the course results, Fall 2013 Report2013Fall.pdf
- Report presentation templates in pdf, tex
- Lists of recommended journals on Machine Learning and Data Analysis:
- High impact factor High_IF_ScientificJournals.pdf
- Low impact factor Low_IF_ScientificJournals.pdf
- On reviewing/resubmitting/correcting the paper:
- Examples of feedback from reviewers: Review1.pdf, Review2.pdf, Review3.pdf
- Sample responses Response1.pdf, Response2.pdf
- Correction sample CorrectedPaper.pdf
Past terms
Link to the course page | Description |
---|---|
Group 274, summer 2015 (In Russian) | My first publication in Higher Attestation Commission journal. The course involves experts and personal assistants. |
Group YАД, summer 2015 (In Russian) | My first publication in Higher Attestation Commission journal. The course involves experts and personal assistants. |
Group 174, summer 2015 (In Russian) | Research planning. |
Group 174, winter 2014 (In Russian) | Conducting commercially-oriented research, developing applications. The problems are chosen from industrial and academical sources. |
Group 974, winter 2014 (In Russian) | Lectures on emerging machine learning issues. Assays and practice in Mathematica. |
Group 174, summer 2014 (In Russian) | My first publication in Higher Attestation Commission journal. The course involves experts and personal assistants. |
Group 074, summer 2014 (In Russian) | Writing assays: brief problem statements and analysis |
Group 974, summer 2014 (In Russian) | The "Software engineering" course, professor L. Karpov |
Requirements
Basic
- The students must have previously passed the analysis, discrete mathematics, probability theory, statistical inference, and optimization algorithms courses.
Advanced
- The students are encouraged to get acquainted with materials of the lecture course on machine learning by K. Vorontsov.
Approximate syllabus
- Find and describe the data. Compose a reference list, and store it in bib-file. Write an annotation to the paper.
- Visualize the data. Make a literature review.
- Write an introduction to the paper. The introduction should include existing methods review and a description of the proposed approach.
- Write a problem statement. Make stress on the novelty of suggested approach. Come up with a solution draft.
- Design computational experiment, obtain initial results.
- Describe the suggested approach in detail.
- Complete computational experiments.
- Describe the results of computational experiments. This includes error analysis and comparison to other methods.
- Correct the paper according to reviewers comments.
- Correct theoretical content.
- Correct the paper's structure.
- Submit the manuscript of the paper to a journal.
- Make a report
Consulting and grading
- The project is divided into separate tasks, each followed by a list of requirements that determine the quality criteria for grading.
- Each task must be completed during the week and submitted the day preceding the lecture.
- Preferably, each task is improved and resubmitted several times before the deadline.
Each completed task (marked with a corresponding letter) yields 1 point, and the suffix +/- adds/subtracts 0.25 points.
Homeworks
Note for assistants. The tasks listed below provide quality citeria for homework grading.
Homework1: synchronization tools
- Acquire the technical computing environment (MATLAB or Octave) .
- Install the typesetting system TeX (MikTeX for Windows, TeX Live for Linux and Mac OS).
- Install a text editor, for example TeXnic Center or WinEdt for Windows, and TeXworks for Linux.
- Install the bibliographic reference manager JabRef.
- Create account at [1] repository and e-mail the login to the group's coordinator. Read [introductory materials] on version control systems.
- Install a subversion client (TortoiseSVN for Windows, RabbitVCS for Linux).
- Following the guidelines, check out the MLAlgorithms repository.
- Create account at MachineLearning.ru and e-mail the login to the group's coordinator.
Run the installed tools, and get acquainted with interfaces.
Homework1: LaTeX
- If necessary, read LaTeX and BibTeX articles.
- Download the article template, ZIP and compile it.
Homework1: MATLAB
- Read [introductory materials] to MATLAB.
- Read documenting conventions Matlab Programming Style Guidelines.