Kaggle数据挖掘竞赛冠军及优胜者代码

2023-12-28 12:32

本文主要是介绍Kaggle数据挖掘竞赛冠军及优胜者代码,希望对大家解决编程问题提供一定的参考价值,需要的开发者们随着小编来一起学习吧!

原文:http://www.chioka.in/kaggle-competition-solutions/

We learn more from code, and from great code. Not necessarily always the 1st ranking solution, because we also learn what makes a stellar and just a good solution. I will post solutions I came upon so we can all learn to become better!

I collected the following source code and interesting discussions from the Kaggle held competitions for learning purposes. Not all competitions are listed because I am only manually collecting them, also some competitions are not listed due to no one sharing. I will add more as time goes by. Thank you.

Algorithmic Trading Challenge

  • Solution whitepaper.
  • Solution thread.

Allstate Purchase Prediction Challenge

  • Rank 2 solution code by Alessandro Mariani.
  • Rank 10 solution code by B1aine.
  • Rank 36 solution code by Hiroyuki.
  • Rank 159 solution code by MrCanard.
  • Solution thread.

Amazon.com – Employee Access Challenge

  • Rank 1 solution code by Paul Duan and Benjamin Solecki team.
  • Rank 1 solution Q&A by Paul Duan.
  • Rank 2 solution code by Owen Zhang.
  • Rank 3 solution code by Dmitry & Leustagos.
  • Rank 289 solution code by Foxtrot with original blog post here.
  • Solution thread.

AMS 2013-2014 Solar Energy Prediction Contest

  • Rank 1 solution code and description by Leustagos team.
  • Rank 2 solution code and description by Toulouse.
  • Rank 3 solution code and description by Owen Zhang.
  • Rank 4 solution escription by Peter Prettenhofer.
  • Rank 5 solution description by Domcastro.
  • Rank 58 solution code and description by Davit.
  • Solution thread here.
  • Ridge Regression starter code with MAE about 2.2M by Alec Radford, original thread here.
  • Improved starter code by Foxtrot.
  • Baseline code with MAE about 2.6M using Catmull-Rom Spline interpolation, also available in R here and here.

Belkin Energy Disaggregation Competition

  • Solution thread.

Challenges in Representation Learning: Facial Expression Recognition Challenge

  • Rank 1 solution code and description by Charlie Tang.
  • Rank 3 solution description by Maxim Milakov.
  • Solution thread.

Challenges in Representation Learning: The Black Box Learning Challenge

  • Rank 1 solution description by David Thaler.
  • Rank 2 solution code and description by sayit.

Challenges in Representation Learning: Multi-modal Learning

  • Rank 1 solution by MMDL.
  • Solution thread.

Detecting Insults in Social Commentary

  • Rank 1 solution description and code by Vivek Sharma.
  • Rank 2 solution by tuzzeg.
  • Rank 3 solution description Andrei Olariu.
  • Rank 4 solution by Chris Brew.
  • Rank 5 solution description by Yasser Tabandeh.
  • Rank 6 solution by Andreas Mueller, code available here.
  • Rank 8 solution description by Steve Poulson.
  • Solution thread.

EMI Music Data Science Hackathon

  • Rank 4 solution description by Steffen Rindle.
  • Rank 18 solution code and description by Vlad Gusev.
  • Rank 34 solution code and description by zenog.
  • Solution thread.

Galaxy Zoo – The Galaxy Challenge

  • Rank 1 solution code and description by Sander Dieleman.
  • Rank 2 solution code and description by Maxim Milakov.
  • Rank 3 solution code and description by tund.
  • Rank 5 solution code and description by Julian de Wit.
  • Rank 9 solution code and description by Soumith Chintala.
  • Rank 13 solution code and description by Xiaoxiang Zhang.
  • Rank 28 solution code and description by utdiscant.
  • Rank 38 solution code and description by sugi.
  • Rank 57 solution code and description by hxu.
  • Rank 58 solution code and description by yr.
  • Solution thread.

Global Energy Forecasting Competition 2012 – Wind Forecasting

  • Rank 1 solution by Leustagos.
  • Solution thread here.

KDD Cup 2013 – Author-Paper Identification Challenge (Track 1)

  • Rank 1 solution with code and description by Team Algorithm, Github link to code here.

KDD Cup 2013 – Author Disambiguation Challenge (Track 2)

  • Rank 1 solution with code and description by Team Algorithm, Github link to code here.
  • Rank 2 solution by SmallData Team.
  • Rank 3 solution by hustmonk.
  • Rank 4 solution by Ben S.
  • Solution thread.

Large Scale Hierarchical Text Classification

  • Rank 1 solution code and description by anttip.
  • Rank 3 solution code and description by nagadomi.
  • Solution thread one.
  • Solution thread two.

Loan Default Prediction – Imperial College London

  • Rank 2 solution and description by HelloWorld.
  • Rank 12 solution and description by David McGarry.
  • Solution thread.

Merck Molecular Activity Challenge

  • Ideas sharing discussion thread.
  • Preprocessing techniques discussion thread.

MLSP 2013 Bird Classification Challenge

  • Rank 1 solution code and description by beluga.
  • Rank 2 solution code and description by Herbal Candy (W and thomeou).
  • Rank 3 solution description by Anil Thomas.
  • Rank 4 solution description by Maxim Milakov.
  • Solution thread.

Observing the Dark World

  • Rank 2 solution by Iain Murray, code available here.

PAKDD 2014 – ASUS Malfunctional Components Prediction

  • Yet another solution thread.
  • Solution thread.

Personalize Expedia Hotel Searches – ICDM 2013

  • Presentation paper/slides for ICDM 2013.
  • Solution thread.

Predicting a Biological Response

  • Rank 6 solution by Shea Parkes & Neil Schneider team.
  • Rank 17 solution of Ensemble of RandomForests, GradientBoostingTrees and ExtraTreesRegressor by Emanuele Olivetti.
    • Another solution code by Oblique Random Forest (oRF) by Shea Parkes & Neil Schneider team.
    • The code of my best submission thread. Talks about Multi-core training Oblique Random Forests, and Stacking.
    • Question about the process of ensemble learning thread. Talks about applying ensembles in practice, and how can problems arise and how to deal with them.

    Predicting Closed Questions on Stack Overflow

    • Rank 10 solution by Marco Lui.
    • Rank 33 solution by Foxtrot.

    See Click Predict Fix

    • Solution thread.

    See Click Predict Fix – Hackathon

    • Solution thread.

    StumbleUpon Evergreen Classification Challenge

    • Benchmark beater 1.
    • Benchmark beater 2.
    • Benchmark beater 3.
    • Solution thread.
    • My own solution, which is a good example of what is overfitting. (Public rank: 57, Private rank: 291)

    The Analytics Edge (15.071x)

    • Rank 17 solution code and description by Foxtrot.
    • Solution thread.

    The Marinexplore and Cornell University Whale Detection Challenge

    • Rank 1 solution by Nick Kridler.
    • Rank 7 solution by Gilles Louppe and Peter Prettenhofer team.
    • Rank 8 solution by Sander Dieleman.
    • Rank 56 solution by Sudeep Juvekar.
    • Solution discussion thread.
    • Mean spectogram thread.
    • Official interview from the Marinexplorer and Cornell at Kaggle.

    Walmart Recruiting – Store Sales Forecasting

    • Rank 1 solution code and description by David Thaler.
    • Rank 2 solution description by sriok.
    • Rank 3 solution code and description by James King.
    • Rank 5 solution description by ACS69.
    • Rank 6 solution description by T. Henry.
    • Rank 8 solution description by BreakfastPirate.
    • Rank 9 solution description by Neil Summers.
    • Rank 10 solution description by Gilberto Titericz Junior.
    • Rank 11 solution description by citynight.
    • Rank 16 solution code and description by yr.
    • Rank 29 solution code and description by Mike Kim.
    • Rank 30 solution description by dkay.
    • Solution thread.

    Thank you Foxtrot, James Petterson, Ben S for providing some of the links and solutions above.

这篇关于Kaggle数据挖掘竞赛冠军及优胜者代码的文章就介绍到这儿,希望我们推荐的文章对编程师们有所帮助!



http://www.chinasem.cn/article/545944

相关文章

springboot循环依赖问题案例代码及解决办法

《springboot循环依赖问题案例代码及解决办法》在SpringBoot中,如果两个或多个Bean之间存在循环依赖(即BeanA依赖BeanB,而BeanB又依赖BeanA),会导致Spring的... 目录1. 什么是循环依赖?2. 循环依赖的场景案例3. 解决循环依赖的常见方法方法 1:使用 @La

使用C#代码在PDF文档中添加、删除和替换图片

《使用C#代码在PDF文档中添加、删除和替换图片》在当今数字化文档处理场景中,动态操作PDF文档中的图像已成为企业级应用开发的核心需求之一,本文将介绍如何在.NET平台使用C#代码在PDF文档中添加、... 目录引言用C#添加图片到PDF文档用C#删除PDF文档中的图片用C#替换PDF文档中的图片引言在当

C#使用SQLite进行大数据量高效处理的代码示例

《C#使用SQLite进行大数据量高效处理的代码示例》在软件开发中,高效处理大数据量是一个常见且具有挑战性的任务,SQLite因其零配置、嵌入式、跨平台的特性,成为许多开发者的首选数据库,本文将深入探... 目录前言准备工作数据实体核心技术批量插入:从乌龟到猎豹的蜕变分页查询:加载百万数据异步处理:拒绝界面

用js控制视频播放进度基本示例代码

《用js控制视频播放进度基本示例代码》写前端的时候,很多的时候是需要支持要网页视频播放的功能,下面这篇文章主要给大家介绍了关于用js控制视频播放进度的相关资料,文中通过代码介绍的非常详细,需要的朋友可... 目录前言html部分:JavaScript部分:注意:总结前言在javascript中控制视频播放

Spring Boot 3.4.3 基于 Spring WebFlux 实现 SSE 功能(代码示例)

《SpringBoot3.4.3基于SpringWebFlux实现SSE功能(代码示例)》SpringBoot3.4.3结合SpringWebFlux实现SSE功能,为实时数据推送提供... 目录1. SSE 简介1.1 什么是 SSE?1.2 SSE 的优点1.3 适用场景2. Spring WebFlu

java之Objects.nonNull用法代码解读

《java之Objects.nonNull用法代码解读》:本文主要介绍java之Objects.nonNull用法代码,具有很好的参考价值,希望对大家有所帮助,如有错误或未考虑完全的地方,望不吝赐... 目录Java之Objects.nonwww.chinasem.cnNull用法代码Objects.nonN

SpringBoot实现MD5加盐算法的示例代码

《SpringBoot实现MD5加盐算法的示例代码》加盐算法是一种用于增强密码安全性的技术,本文主要介绍了SpringBoot实现MD5加盐算法的示例代码,文中通过示例代码介绍的非常详细,对大家的学习... 目录一、什么是加盐算法二、如何实现加盐算法2.1 加盐算法代码实现2.2 注册页面中进行密码加盐2.

python+opencv处理颜色之将目标颜色转换实例代码

《python+opencv处理颜色之将目标颜色转换实例代码》OpenCV是一个的跨平台计算机视觉库,可以运行在Linux、Windows和MacOS操作系统上,:本文主要介绍python+ope... 目录下面是代码+ 效果 + 解释转HSV: 关于颜色总是要转HSV的掩膜再标注总结 目标:将红色的部分滤

在C#中调用Python代码的两种实现方式

《在C#中调用Python代码的两种实现方式》:本文主要介绍在C#中调用Python代码的两种实现方式,具有很好的参考价值,希望对大家有所帮助,如有错误或未考虑完全的地方,望不吝赐教... 目录C#调用python代码的方式1. 使用 Python.NET2. 使用外部进程调用 Python 脚本总结C#调

Java时间轮调度算法的代码实现

《Java时间轮调度算法的代码实现》时间轮是一种高效的定时调度算法,主要用于管理延时任务或周期性任务,它通过一个环形数组(时间轮)和指针来实现,将大量定时任务分摊到固定的时间槽中,极大地降低了时间复杂... 目录1、简述2、时间轮的原理3. 时间轮的实现步骤3.1 定义时间槽3.2 定义时间轮3.3 使用时