databricks-hackathon-2023.devpost.com Open in urlscan Pro
54.145.20.225  Public Scan

Submitted URL: http://bit.ly/sytychi
Effective URL: https://databricks-hackathon-2023.devpost.com/?utm_source=devpost&utm_medium=email&utm_campaign=sytych23&utm_content=influencer
Submission: On June 13 via api from SG — Scanned from SG

Form analysis 2 forms found in the DOM

GET https://devpost.com/hackathons

<form class="search-bar-form flex-row align-center" action="https://devpost.com/hackathons" accept-charset="UTF-8" method="get"><input name="utf8" type="hidden" value="✓">
  <a id="collapse-mobile-search" data-toggle-mobile-search=""><i class="fas fa-arrow-left" aria-hidden="true"></i></a>
  <input type="search" name="search" id="search" title="Search" placeholder="Search hackathons">
  <button type="submit" class="button postfix dark">
    <i class="fas fa-search" aria-hidden="true"></i>
  </button>
</form>

GET https://devpost.com/hackathons

<form class="search-bar-form large flex-row" action="https://devpost.com/hackathons" accept-charset="UTF-8" method="get"><input name="utf8" type="hidden" value="✓">
  <input type="search" name="search" id="search" title="Search" placeholder="Search hackathons">
  <button type="submit" class="button postfix dark">
    <i class="fas fa-search" aria-hidden="true"></i>
  </button>
</form>

Text Content

 * 
 * 
 * 
 * * Log in
   * Sign up
 * 


 * 
 * Hackathons
 * 
 * Projects
 * 
 * Host a hackathon

 * 

 * 
 * Hackathons
 * 
 * Projects
 * 

 * Host a hackathon

 * Log in
 * Sign up




SO YOU THINK YOU CAN HACK

Deadline: Jun 16, 2023 @ 5:00pm PDT
Join hackathon

Descend
 * Overview
 * My projects
 * Participants (715)
 * Resources
 * Rules
 * Project gallery
 * Updates
 * Discussions


SO YOU THINK YOU CAN HACK


JOIN US FOR A HACKATHON TO SHOWCASE OPEN-SOURCE LLMS (E.G., OPENASSISTANT, MPT,
DOLLY, ETC.) AND OR SPARK CONNECT.

Join hackathon

WHO CAN PARTICIPATE

 * Above legal age of majority in country of residence
 * All countries/territories, excluding standard exceptions

View full rules

4 more days to deadline

View schedule

Deadline: Jun 16, 2023 @ 5:00pm PDT
 * Apple
 * Google
 * Outlook

Online
Public
$20,000 in prizes 715 participants

Databricks
Machine Learning/AI Databases
Managed by Devpost

Join forces with data scientists, engineers, and analysts. As a pre-cursor to
the upcoming Data + AI summit, we invite you to create unique and novel
applications, use cases, and/or techniques to showcase open-source LLM models
(e.g., OpenAssistant, MPT, Dolly, etc.) and/or Spark Connect. While we can't get
enough of Dolly, we also suggest you check out LangChain, PandasAI, and vector
databases.

Three finalist teams and five honorable mention awards will be selected and
announced during the Data+ AI Summit 2023 keynote, and you will have the
opportunity to win a cash prize to be split among your team. 

GET STARTED

We encourage you to create your own novel application or use case using the
following resources:



LLM LEARNING RESOURCES:



• Hugging Face Open Assistant:
     ▹ https://open-assistant.io/
     ▹ https://huggingface.co/OpenAssistant
• Mosaic ML MPT-7B:
     ▹ https://www.mosaicml.com/blog/mpt-7b
     ▹ https://github.com/mosaicml/llm-foundry
     ▹ mosaicml/mpt-7b · Hugging Face
• Databricks Lab Dolly GitHub repo: https://github.com/databrickslabs/dolly
     ▹ Hugging Face > Databricks
     ▹ Free Dolly: Introducing the World's First Truly Open Instruction-Tuned
LLM
     ▹ Hello Dolly: Democratizing the magic of ChatGPT with open models

SPARK CONNECT LEARNING RESOURCES:



• Reference documentation
     ▹ Spark Connect Overview
     ▹ Spark Connect Quick Start
• Blogs
     ▹ Introducing Spark Connect - The Power of Apache Spark, Everywhere
     ▹ Spark Connect Available in Apache Spark 3.4

POTENTIAL PROJECT IDEAS:

• Work with parsing audio transcriptions such as OpenAI's Whisper
• Create AI Agents with compute and search capabilities (LangChain is a great
place to work on these kinds of tools)
• Build a QA bot with vector databases leveraging similarity searching
• Tune a DLite or Dolly model using Databricks.

The judges will assess your projects' performance with a standard battery of
diverse prompts while accounting for quantitative metrics like latency.

To get started, you can use the following example notebooks as your guide:



• Build your Chat Bot with Dolly
• AI Functions: query LLM with DBSQL

We encourage you to use open-source models and datasets such as (but not limited
to):
• Dolly 15K dataset
• Red Pajama dataset
• OpenAssistant Conversations dataset (OASST1)
• LongForm dataset
• Alpaca Libra dataset
• Eleuther.AI datasets
• Fun beginner-friendly datasets on Kaggle
• Hugging Face instruct_me dataset (highly rated, general purpose open-source,
Apache v2)

Additional Guidance



If you are building LLM applications, we’d recommend tools like LangChain,
Pandas AI, and vector databases




REQUIREMENTS

Step 1. BUILD YOUR TEAM.  Your team can have up to 4 participants and must be
registered here on Devpost as participating in the hackathon.

Step 2. Create a new application using an open-source large language model (LLM)
like OpenAssistant, MPT, Dolly, or others, or create a new one using Spark
Connect. Your application must have been created after May 18th, and all work
must be completed during the hackathon timeframe. Create a compelling project
that showcases open LLM models in new and useful way. It is preferred, but not
required, to showcase these use cases within a Databricks or Jupyter notebook. 

Step 3. Record a video screencast (<= 280 seconds) demonstrating the
application, providing commentary answering the following questions:

 * Why did you choose this topic?
 * Which open-source LLM and any additional open-source datasets did you use?
   Explain why.
 * Or what is your Spark Connect application and any open-source datasets did
   you use?  Explain why.
 * How does your project provide relevant and insightful information to the end
   user?

Step 4. Complete your submission on Devpost before 5 PM PT on June 16th. This
includes a project description with the following:

 * Your hosted video
 * A URL to your application open-source source code. Your GitHub repository
   should not have any contributions before May 18th and include an open-source
   license.
 * Invite all of your teammates and make sure they accept it.


HACKATHON SPONSORS




PRIZES

$20,000 in prizes


GRAND PRIZE WINNING TEAM

• $10,000 USD
• Project Highlight at Data + AI Summit

2ND PLACE WINNING TEAM

• $5,000 USD
• Project Highlight at Data + AI Summit

3RD PLACE WINNING TEAM

• $2,500 USD
• Project Highlight at Data + AI Summit

HONORABLE MENTIONS (5)

• $500 USD


DEVPOST ACHIEVEMENTS

Submitting to this hackathon could earn you:




JUDGES

Mike Conover
Staff Software Engineer, Databricks

Stefania Leone
Sr. Manager, Product Management, Databricks

Martin Grund
Senior Staff Software Engineer, Databricks

Conor B. Murphy
Sr. Data Science Manager, Databricks

Benjamin Harvey, Ph.D.
Founder and CEOFounder & CEO, AI Squared

Sean Owen
Principal Specialist for Data Science and ML, Databricks


JUDGING CRITERIA

 * Creativity
   Is this a new and original idea, or has this been done before?
 * Relevance
   How have you combined relevant and interesting datasets and tools?
 * Thoroughness
   Is your application easy for the end user to understand? Does it provide
   relevant and insightful information?
 * Quality of submission
   How well-written and organized are your description, video explanation, and
   any provided visual presentation?

Questions? Email the hackathon manager

Invite others to compete

 * 
 * 
 * 


HACKATHON SPONSORS

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of
Service apply.

DEVPOST

 * About
 * Careers
 * Contact
 * Help

HACKATHONS

 * Browse hackathons
 * Explore projects
 * Host a hackathon
 * Hackathon guides

PORTFOLIO

 * Your projects
 * Your hackathons
 * Settings

CONNECT

 * 
   Twitter
 * 
   Discord
 * 
   Facebook
 * 
   YouTube

© 2023 Devpost, Inc. All rights reserved.
 * Community guidelines
 * Security
 * CA notice
 * Privacy policy
 * Terms of service