Weka Text Classification for First Time & Beginner Users
59-minute beginner-friendly tutorial on text classification in WEKA; all text changes to numbers and categories after 1-2, so 3-5 relate to many other data analysis (not specifically text classification) using WEKA. 5 main sections: 0:00 Introduction (5 minutes) 5:06 TextToDirectoryLoader (3 minutes) 8:12 StringToWordVector (19 minutes) 27:37 AttributeSelect (10 minutes) 37:37 Cost Sensitivity and Class Imbalance (8 minutes) 45:45 Classifiers (14 minutes) 59:07 Conclusion (20 seconds) Some notable sub-sections: - Section 1 - 5:49 TextDirectoryLoader Command (1 minute) - Section 2 - 6:44 ARFF File Syntax (1 minute 30 seconds) 8:10 Vectorizing Documents (2 minutes) 10:15 WordsToKeep setting/Word Presence (1 minute 10 seconds) 11:26 OutputWordCount setting/Word Frequency (25 seconds) 11:51 DoNotOperateOnAPerClassBasis setting (40 seconds) 12:34 IDFTransform and TFTransform settings/TF-IDF score (1 minute 30 seconds) 14:09 NormalizeDocLength setting (1 minute 17 seconds) 15:46 Stemmer setting/Lemmatization (1 minute 10 seconds) 16:56 Stopwords setting/Custom Stopwords File (1 minute 54 seconds) 18:50 Tokenizer setting/NGram Tokenizer/Bigrams/Trigrams/Alphabetical Tokenizer (2 minutes 35 seconds) 21:25 MinTermFreq setting (20 seconds) 21:45 PeriodicPruning setting (40 seconds) 22:25 AttributeNamePrefix setting (16 seconds) 22:42 LowerCaseTokens setting (1 minute 2 seconds) 23:45 AttributeIndices setting (2 minutes 4 seconds) - Section 3 - 28:07 AttributeSelect for reducing dataset to improve classifier performance/InfoGainEval evaluator/Ranker search (7 minutes) - Section 4 - 38:32 CostSensitiveClassifer/Adding cost effectiveness to base classifier (2 minutes 20 seconds) 42:17 Resample filter/Example of undersampling majority class (1 minute 10 seconds) 43:27 SMOTE filter/Example of oversampling the minority class (1 minute) - Section 5 - 45:34 Training vs. Testing Datasets (1 minute 32 seconds) 47:07 Naive Bayes Classifier (1 minute 57 seconds) 49:04 Multinomial Naive Bayes Classifier (10 seconds) 49:33 K Nearest Neighbor Classifier (1 minute 34 seconds) 51:17 J48 (Decision Tree) Classifier (2 minutes 32 seconds) 53:50 Random Forest Classifier (1 minute 39 seconds) 55:55 SMO (Support Vector Machine) Classifier (1 minute 38 seconds) 57:35 Supervised vs Semi-Supervised vs Unsupervised Learning/Clustering (1 minute 20 seconds) Classifiers introduces you to six (but not all) of WEKA's popular classifiers for text mining; 1) Naive Bayes, 2) Multinomial Naive Bayes, 3) K Nearest Neighbor, 4) J48, 5) Random Forest and 6) SMO. Each StringToWordVector setting is shown, e.g. tokenizer, outputWordCounts, normalizeDocLength, TF-IDF, stopwords, stemmer, etc. These are ways of representing documents as document vectors. Automatically converting 2,000 text files (plain text documents) into an ARFF file with TextDirectoryLoader is shown. Additionally shown is AttributeSelect which is a way of improving classifier performance by reducing the dataset. Cost-Sensitive Classifier is shown which is a way of assigning weights to different types of guesses. Resample and SMOTE are shown as ways of undersampling the majority class and oversampling the majority class. Introductory tips are shared throughout, e.g. distinguishing supervised learning (which is most of data mining) from semi-supervised and unsupervised learning, making identically-formatted training and testing datasets, how to easily subset outliers with the Visualize tab and more... ---------- Update March 24, 2014: Some people asked where to download the movie review data. It is named Polarity_Dataset_v2.0 and shared on Bo Pang's Cornell Ph.D. student page http://www.cs.cornell.edu/People/pabo/movie-review-data/ (Bo Pang is now a Senior Research Scientist at Google)
Text Classification - Natural Language Processing With Python and NLTK p.11
Now that we understand some of the basics of of natural language processing with the Python NLTK module, we're ready to try out text classification. This is where we attempt to identify a body of text with some sort of label. To start, we're going to use some sort of binary label. Examples of this could be identifying text as spam or not, or, like what we'll be doing, positive sentiment or negative sentiment. Playlist link: https://www.youtube.com/watch?v=FLZvOKSCkxY&list=PLQVvvaa0QuDf2JswnfiGkliBInZnIC4HL&index=1 sample code: http://pythonprogramming.net http://hkinsley.com https://twitter.com/sentdex http://sentdex.com http://seaofbtc.com
Intro to Feature Engineering with TensorFlow - Machine Learning Recipes #9
Hey everyone! Here’s an intro to techniques you can use to represent your features - including Bucketing, Crossing, Hashing, and Embedding - and utilities TensorFlow provides to help. Also included is a walkthrough of using TensorFlow Estimators to classify structured data. Links from the video: Code - https://goo.gl/K9dVqv Facets: https://goo.gl/Dfpb7W TensorFlow Embedding Projector: https://goo.gl/2SxrYK You can find Josh on Twitter: https://twitter.com/random_forests See Josh as a guest speaker in Week 2 of the openSAP course: https://goo.gl/UGGcX7 Thanks, and have fun! Check out more Machine Learning Recipes here: https://goo.gl/KewA03 Subscribe to the Google Developers channel: http://goo.gl/mQyv5L
Random Forest - Fun and Easy Machine Learning
Random Forest - Fun and Easy Machine Learning ►FREE YOLO GIFT - http://augmentedstartups.info/yolofreegiftsp ►KERAS COURSE - https://www.udemy.com/machine-learning-fun-and-easy-using-python-and-keras/?couponCode=YOUTUBE_ML ►MACHINE LEARNING COURSES -http://augmentedstartups.info/machine-learning-courses ------------------------------------------------------------------------ Hey Guys, and welcome to another Fun and Easy Machine Learning Algorithm on Random Forests. Random forest algorithm is a one of the most popular and most powerful supervised Machine Learning algorithm in Machine Learning that is capable of performing both regression and classification tasks. As the name suggest, this algorithm creates the forest with a number of decision trees. In general, the more trees in the forest the more robust the prediction. In the same way in the random forest classifier, the higher the number of trees in the forest gives the high accuracy results. To model multiple decision trees to create the forest you are not going to use the same method of constructing the decision with information gain or gini index approach, amongst other algorithms. If you are not aware of the concepts of decision tree classifier, Please check out my lecture here on Decision Tree CART for Machine learning. You will need to know how the decision tree classifier works before you can learn the working nature of the random forest algorithm. ------------------------------------------------------------ Support us on Patreon ►AugmentedStartups.info/Patreon Chat to us on Discord ►AugmentedStartups.info/discord Interact with us on Facebook ►AugmentedStartups.info/Facebook Check my latest work on Instagram ►AugmentedStartups.info/instagram Learn Advanced Tutorials on Udemy ►AugmentedStartups.info/udemy ------------------------------------------------------------ To learn more on Artificial Intelligence, Augmented Reality IoT, Deep Learning FPGAs, Arduinos, PCB Design and Image Processing then check out http://augmentedstartups.info/home Please Like and Subscribe for more videos :)
Hello World - Machine Learning Recipes #1
Six lines of Python is all it takes to write your first machine learning program! In this episode, we'll briefly introduce what machine learning is and why it's important. Then, we'll follow a recipe for supervised learning (a technique to create a classifier from examples) and code it up. Follow https://twitter.com/random_forests for updates on new episodes! Subscribe to the Google Developers: http://goo.gl/mQyv5L - Subscribe to the brand new Firebase Channel: https://goo.gl/9giPHG And here's our playlist: https://goo.gl/KewA03
WEKA API 2/19: Loading and Saving Data
To access the code go to the Machine Learning Tutorials Section on the Tutorials page here: http://www.brunel.ac.uk/~csstnns Using WEKA in java
SeLeCT: Self-Learning Classifier for Internet Traffic
Including Packages ===================== * Complete Source Code * Complete Documentation * Complete Presentation Slides * Flow Diagram * Database File * Screenshots * Execution Procedure * Readme File * Addons * Video Tutorials * Supporting Softwares Specialization ======================= * 24/7 Support * Ticketing System * Voice Conference * Video On Demand * * Remote Connectivity * * Code Customization ** * Document Customization ** * Live Chat Support * Toll Free Support * Call Us:+91 967-774-8277, +91 967-775-1577, +91 958-553-3547 Shop Now @ http://clickmyproject.com Get Discount @ https://goo.gl/lGybbe Chat Now @ http://goo.gl/snglrO Visit Our Channel: http://www.youtube.com/clickmyproject Mail Us: [email protected]
Ankus (Data Mining and Machine Learning) Open Source User Guide Video
하둡 기반의 Ankus (Data Mining and Machine Learning) 오픈소스의 전체 기능을 가이드한 동영상입니다.
Maxence Bernard CEO of IO Square on Text Analytics and Machine Learning at iDate
Maxence Bernard, CEO of IO Square will speak at the 42nd international iDate Online & Mobile Dating Industry Conference & Summit. The event takes place October 14-16, 2015 a the Strand Palace Hotel in London, United Kingdom. Mr. Bernard will cover on Text Analytics and Machine Learning and how it applies to the dating industry. iDate is a business to business conference covering the global dating industry. The European conference, particularly the one in Great Britain, sheds insights for the CEO to the UK and Euro dating marketplace. iDate is attended by C-Level executives from the online dating, mobile dating, matchmaking, social discovery and social media segments. Each gain knowledge, insight and ultimately earn more revenue from the sessions and the networking the event provides. To see other videos about the event, click here: http://www.youtube.com/watch?v=BBiT1dPJjh4 http://www.youtube.com/watch?v=I9CB_HQM5Zs For more information see: http://www.internetdatingconference.com
K-Fold Cross Validation - Intro to Machine Learning
This video is part of an online course, Intro to Machine Learning. Check out the course here: https://www.udacity.com/course/ud120. This course was designed as part of a program to help you and others become a Data Analyst. You can check out the full details of the program here: https://www.udacity.com/course/nd002.
Decision Tree 1: how it works
Full lecture: http://bit.ly/D-Tree A Decision Tree recursively splits training data into subsets based on the value of a single attribute. Each split corresponds to a node in the. Splitting stops when every subset is pure (all elements belong to a single class) -- this can always be achieved, unless there are duplicate training examples with different classes.
How to Apply Naive Bayes in Spam Detection - Part I
Access +100 programming courses in Zenva: https://academy.zenva.com/?zva_src=youtube In this course we’ll use Python to create an Artificial Intelligence (AI) that can determine when an incoming email is spam or not. The technique we’ll use to create this cool project is called Text Classification. The group of algorithms that we’ll cover and use is Naive Bayes. Using term frequency and inverse document frequency we’ll be able to tweak our AI for an improved accuracy. To build our AI we’ll use the publicly available Enron dataset. Our tutorial blogs: GameDev Academy: https://gamedevacademy.org HTML5 Hive: https://html5hive.org Android Kennel: https://androidkennel.org Swift Ludus: https://swiftludus.org De Idea A App: https://deideaaapp.org Twitter: @ZenvaTweets
Sargur Srihari Basic  14 of 26
Measurement Science and Standards in Forensic Handwriting Analysis Conference 14 of 26
How Random Forest algorithm works
In this video I explain very briefly how the Random Forest algorithm works with a simple example composed by 4 decision trees. The presentation is available at: https://prezi.com/905bwnaa7dva/?utm_campaign=share&utm_medium=copy
Applied Machine Learning, Part 1: Feature Engineering
Explore how to perform feature engineering, a technique for transforming raw data into features that are suitable for a machine learning algorithm. - MATLAB for Machine Learning: https://bit.ly/2tUPS0O - Try it now in your browser: https://bit.ly/2IS82KT Feature engineering starts with your best guess about what features might influence the action you’re trying to predict. After that, it’s an iterative process where you create new features, add them to your model, and see if your results have improved.   This video provides a high-level overview of the topic, and it uses several examples to illustrate basic principles behind feature engineering and established ways for extracting features from signals, text, and images. -------------------------------------------------------------------------------------------------------- Get a free product Trial: https://goo.gl/ZHFb5u Learn more about MATLAB: https://goo.gl/8QV7ZZ Learn more about Simulink: https://goo.gl/nqnbLe See What's new in MATLAB and Simulink: https://goo.gl/pgGtod © 2019 The MathWorks, Inc. MATLAB and Simulink are registered trademarks of The MathWorks, Inc. See www.mathworks.com/trademarks for a list of additional trademarks. Other product or brand names may be trademarks or registered trademarks of their respective holders.
An SMS Text Classification System for UNICEF Uganda
Speaker: Rick Lawrence, Senior Manager, Machine Learning & Decision Analytics at IBM Research U-report is an open-source SMS platform operated by UNICEF Uganda, designed to give community members a voice on issues that impact them. Data received by the system are either SMS responses to a poll conducted by UNICEF or unsolicited reports of problems occurring anywhere within Uganda. There are currently 300,000 U-report participants, and they can send up to 10,000 unsolicited text messages a week. The objective of the program in Uganda is to understand the data in real-time, and have issues addressed by the appropriate department in UNICEF in a timely manner. This talk describes an automated message-understanding and routing system deployed by IBM and UNICEF in Uganda. We discuss a dual-supervision machine learning approach to leverage human-generated labels on both features and text examples, and conclude with a discussion of the societal impact that U-report is driving in Uganda.
Machine Learning with Mahout -Tutorial
The details of course are available here: https://cloudxlab.com/course/specialization/3/big-data-with-hadoop-and-spark (KnowBigData.com's is now CloudxLab.com) Subscribe to our channel for latest videos - https://www.youtube.com/channel/UC8mJ6DL1Q32UWyJUceoO8Jw This is the Introductory session on Machine learning with Mahout. It clears a lot of myths and confusion about Machine learning with Mahout. How exactly Mahout helps to build recommendations. This is a part of Hadoop and Big Data course. Our full Hadoop and Big Data course consists of Introduction of Hadoop and Big Data,HDFS architecture ,MapReduce ,YARN ,PIG Latin ,Hive,HBase,Mahout,Zookeeper,Oozie,Flume,Spark,Nosql with quizzes and assignments. More details below: - - - - - - - - - - How this works - 1. Our classes are conducted live online by our instructors via webinar or hangout. These are not pre-recorded classes. The instructor delivers the class using presentations, collaborative drawing tools, screenshares. 2. Every class is recorded, complete with the screen and the audio, and uploaded to the Learning Management System which is accessible to our attendees for life. 3. At the end of each session, assignments are provided which the attendees have to submit in the LMS. The assignments are continuously reviewed by our instructors and teaching assistants. In case we conclude that an attendee requires extra detailing, we schedule extra one-on-one sessions with that attendee. 5. After all sessions are over, we ask for the student's preference for a project. We form teams of 3-4 members and based on their interests we assign a project to each team. A project is usually of three weeks duration. If a team has an idea it wants to work on as a project, we screen the idea and the team can work on it, or we assign a project from the industry. 6. Based on your performance in Quizzes, Assignments and Projects, we provide the certificate and LinkedIn Recommendation, we will endorse you with tags such as Hadoop, Big Data. - - - - - - - - - - About the Our Big Data and Hadoop Course Our Big Data and Hadoop course is designed to impart knowledge, skills and hands on experience required to become a successful Hadoop Developer, Administrator or Tester. Concepts Covered: Big Data, NoSQL, Streaming, Analytics Tools Covered: HDFS, MapReduce, Pig, Hive, HBASE, Zookeeper, Flume, Sqoop, Oozie, Spark, Storm, Mahout - - - - - - - - - - What makes this Course unique - Interactive Classes: More Questions. Less Lectures. Simple explanations to complex topics by industry experts Hands on workshops and real life projects. Quizzes & Assignments Certificate of Course at the end of course A real life project involving Hadoop Lifetime access to course content Cloud Labs™ - Access to the cloud infrastructure if learners don't wish to install Hadoop on their computers - - - - - - - - - - What are the prerequisites to join Big Data and Hadoop course? To be able to take maximum benefit out of this course, you should have knowledge of the following: 1. Basics of SQL. 2. A know-how of the basics of programming. We will be providing video classes covering the basics of Java and Python. - - - - - - - - - - - - - Why Learn Big Data and Hadoop? Big Data is a collection of massive and complex data sets that are very difficult to manage and process with the existing tools intended for that purpose. Data generation is becoming a more obvious result of our everyday devices becoming cheaper, more powerful, compact and connected. We are generating data all the time such as tweeting, using emails, using facebook, uploading photos etc. Similarly our devices are also connected and are generating data. The result is a gargantuan mass of data that needs to be looked at for informed decision making. The only way ahead for organizations is to be able to store and process such large amounts of data, and for which, they use Big Data platforms like Hadoop. That proves the high demand of Hadoop Developers, Administrators, Tester and Scientists. The other way to measure the demand for Big Data and Hadoop technologies is to look at the number of jobs being posted around the world on these technologies. Also, Big Data features in the top #3 technology trends in organizations as per Forbes and Gartner. - - - - - - - - - - - - - - Please visit https://cloudxlab.com/course/1/big-data-with-hadoop-spark for more details and upcoming classes. For any queries call us at: +91 (80) 492-022-24 - IN / +1 (412) 568-3901 - US or write us at: [email protected] Post your questions on our forum - https://discuss.cloudxlab.com/
Extreme Classification: A New Paradigm for Ranking & Recommendation
The objective in extreme multi-label classification is to learn a classifier that can automatically tag a data point with the most relevant subset of labels from a large label set. Extreme multi-label classification is an important research problem since not only does it enable the tackling of applications with many labels but it also allows the reformulation of ranking and recommendation problems with certain advantages over existing formulations. Our objective, in this talk, is to develop an extreme multi-label classifier that is faster to train and more accurate at prediction than the state-of-the-art Multi-label Random Forest (MLRF) algorithm [Agrawal et al. WWW 13] and the Label Partitioning for Sub-linear Ranking (LPSR) algorithm [Weston et al. ICML 13]. MLRF and LPSR learn a hierarchy to deal with the large number of labels but optimize task independent measures, such as the Gini index or clustering error, in order to learn the hierarchy. Our proposed FastXML algorithm achieves significantly higher accuracies by directly optimizing an nDCG based ranking loss function. We also develop an alternating minimization algorithm for efficiently optimizing the proposed formulation. Experiments reveal that FastXML can be trained on problems with more than a million labels on a standard desktop in eight hours using a single core and in an hour using multiple cores.
Identifying Suspicious URLs: An Application of Large-Scale Online Learning
Google Tech Talk May 5, 2010 ABSTRACT Presented by Justin Ma. We explore online learning approaches for detecting malicious Web sites (those involved in criminal scams) using lexical and host-based features of the associated URLs. We show that this application is particularly appropriate for online algorithms as the size of the training data is larger than can be efficiently processed in batch and because the distribution of features that typify malicious URLs is changing continuously. Using a real-time system we developed for gathering URL features, combined with a real-time source of labeled URLs from a large Web mail provider, we demonstrate that recently-developed online algorithms can be as accurate as batch techniques, achieving daily classification accuracies up to 99% over a balanced data set. Slides: http://cseweb.ucsd.edu/~jtma/google_talk/jtma-google10.pdf Justin Ma is a PhD candidate at UC San Diego advised by Stefan Savage, Geoff Voelker and Lawrence Saul. His research interests are in systems and networking with an emphasis on network security, and his current focus is the application of machine learning to problems in security. He will be joining UC Berkeley as a postdoc after graduation. [Home page: http://www.cs.ucsd.edu/~jtma/ ]
Introducing Cloud AutoML
Cloud AutoML is a suite of Machine Learning products that enables developers with limited machine learning expertise to train high quality models by leveraging Google’s state of the art transfer learning, and Neural Architecture Search technology. AutoML Vision is the first product to be released. It is a simple, secure and flexible ML service that lets you train custom vision models for your own use cases. Soon, Cloud AutoML will release other services for all other major fields of AI. Product page: https://cloud.google.com/automl/ Blog post: https://www.blog.google/topics/google-cloud/cloud-automl-making-ai-accessible-every-business/ Podcast: https://www.gcppodcast.com/post/episode-109-cloud-automl-vision-with-amy-unruh-and-sara-robinson/ Using Distributed TensorFlow with Cloud ML Engine and Cloud Datalab → http://bit.ly/2ILCe5b
C4.5 algorithm
C4.5 is an algorithm used to generate a decision tree developed by Ross Quinlan. C4.5 is an extension of Quinlan's earlier ID3 algorithm. The decision trees generated by C4.5 can be used for classification, and for this reason, C4.5 is often referred to as a statistical classifier. This video is targeted to blind users. Attribution: Article text available under CC-BY-SA Creative Commons image source in video
Predicting Phishing/Non Phisihing Website Detection using Machine Learning Algorithm using Matlab
Our method based on available features on URL and page contents without using the search engines such Google ets, to detect the phishing websites where our methodology target to extract the most number of features exist in literature then find the robust features that are not effected by concept drift this is to answer the question are there features can give the required accuracy when the training and testing data come from different times? as the phishers changes there tactics from time to time. After we find such features using machine learning algorithms such as Genetic Algorithm, Ant Colony Optimization (ACO) and Particle Swarm Optimization (PSO) to examine the performance and by applying classifier using Artificial Neural Network(ANN), Support Vector Machine (SVM) and Treefit Algorithm to decide which one give us the best performance . The performance analysis have to be done using software simulation such as the Accuracy ,Sensitivity and Selectivity and all parameters related to examine the performance using Matlab.
Machine Learning | 2 بالعربي
Machine Learning | 2 بالعربي تعريف آرثر و توم مينشل عن الـ Machine Learning كيف تتعلم الآلة و مفاهيم متعلقة بانواع التعليم الآلي
Collections as Data: Stewardship and Use Models to Enhance Access
The rise of accessible digital collections coupled with the development of tools for processing and analyzing data has enabled researchers to create new models of scholarship and inquiry. The National Digital Initiatives team invites leaders and experts from organizations that are collecting, preserving and providing researcher access to digital collections as data to share best practices and lessons learned. This event will also highlight new collaborative initiatives at the Library of Congress that seek to enhance researcher engagement and the use of digital collections as data. Hashtag: #AsData Schedule: http://digitalpreservation.gov/meetings/dcs16.html
12 week Data Science Bootcamp Students Machine Learning project demo Hillary Clinton Emails Explorer
Project Description: Throughout 2015, Hillary Clinton has been embroiled in controversyover the use of personal email accounts on non-government servers during her time as the United States Secretary of State. Thanks to the Freedom of Information Act, on Monday, August 31, 2015, the State Department released nearly 7,000 pages of Clinton's heavily redacted emails. John Montroy, Jake Lehrhoff and Chris Neimeth took those emails and wrangled them for your exploring pleasure. During our presentation they will cover the tasks involved with munging and analyzing this data, including NLTK, sentiment analysis, MYSQL, Python, Flask, an AWS instance and lots of elbow grease. The project output includes:- A dashboard to enable exploration of Hillary’s emails - Displays of topics by sender and recipient - Sentiment analysis of emails Speaker Bio: Chris Neimeth is a serial entrepreneur in the technology, media and entertainment businesses.Chris has served in various strategic roles: CEO of Salon Media Group Inc., President of IAC Partner Marketing, Executive Vice President of Ticketmaster, President/CEO of Real Media, Chief Commercial Officer of Daylife, Senior Vice President for The New York Times Company Digital, and founder of Grey Interactive.He has twice served as member of the Aspen Institute Forum on Communication and Society, and is a two time elected Director of the Interactive Advertising Bureau. Projects: http://blog.nycdatascience.com/uncategorized/mass-shootings-in-america/ Jake Lehrhoff is a man of many hats. For six years he taught middle school English and chaired the department at a school for children with moderate-to-severe emotional and behavioral disorders. He developed a system of intradepartmental supervision to monitor the efficiency and effectiveness of the billing department of a rheumatology laboratory. He wrote a novel about an autistic boy and edited the memoire of a triathlete. Jake holds a BA in psychology from Wesleyan University and an MA in psychology from Brandeis University, where he studied quantitative research methods and statistics and graduated with a perfect GPA. Jake takes great satisfaction in solving problems and is excited to apply his knowledge of machine learning and skills in R and Python to tackle new challenges. Blog: http://blog.nycdatascience.com/author/jake.lehrhoff/ John Montroy is a graduate of Middlebury College with a B.A. in Physics. After a summer of particle physics at CERN with the Harvard ATLAS team, he began his career as a data analyst in the auto industry. He has been programming since the age of 12, and delights in clean, re-usable, and functionally-oriented code. A self-starter and curious thinker, his interests run the gamut from mathematics to classical music. In his spare time, he can be found playing piano or mandolin, singing barbershop, and running. github: https://github.com/jmontroy90/teamhrc blog: http://blog.nycdatascience.com/author/jmontroy90/
