May 18, 2016
San Francisco, CA Buy Tickets
The future is already here, it is just not evenly distributed. But it clearly shows in our 150 talks, comprising 7 conferences, bounded by the 5 days conference matrix. 50+ founders/CEOs/CTOs speaking.
In-depth talks from Google (BigQuery and Translate), Baidu Research, MetaMind, StitchFix (Deep Learning), Microsoft, Bloomberg, Quora, Kaggle, Dato (Machine Learning), Netflix (Recommender Systems), IBM (Watson), Facebook, ClearStory (DataViz), LinkedIn, Yahoo, H2O, Confluent, Mesosphere (Data Pipelines), Samsung, Automatic (IoT), AMPLab, Databricks, Salesforce, Workday, Cloudera (Spark), Pivotal (OSS), Zillow, Pandora, Nitro, Lucidworks, Mattermark, Credit Karma, Alpine Labs, , University of California-Berkeley, Stanford University, City of San Francisco, and many others.
Buy TicketsOnly 300 tickets for each day will be available to have a truly intimate technical community atmosphere.
Law By the Bay is a focused extension of Text By the Bay, the first applied legal data/NLP conference. The key idea is to take open-source tools developed by the best researchers and practitioners, working at scale, and build a community of legal-centric startup users understanding and improving them to run a business. Scientific rigor and excellent software engineering are the two key properties of the systems we use. Law By the Bay directly follows Text By the Bay, and runs on a parallel track with Data for Democracy.
Brought to you by the organizers of SF Text, SF Scala, SF Spark, Reactive Systems,
Text By the Bay 2015, Big Data Scala 2015, and Scala By the Bay 2013-2015.
Conference News
Registration is open
Schedule is Published
Law By the Bay: Legal Informatics
A growing applied NLP/search/data-centric conference bringing together legal professionals, startup engineers, researchers, and entrepreneurs, using text mining to build new companies through understanding of business and legal data. EVery business document should have clear meaning as it may have legal, contractual, and complicance ramifications. Although much of ecommerce moved online, business workflows exist within private document-centric ecosystems. In every company there's a "document balck market" where documents are shared laterally among the employees without explicit governance and machine understanding helping the company understand where information is. We bring the same successful approaches powering search and NLP on the Web to business document ecosystem.
Please see the umbrella Data By the Bay description of a good talk By the Bay.
For Law By the Bay, some specific topics of interest include those covered in Text By the Bay 2015 and beyond and applying to legal domain, such as
- Open-source libraries for parsing, entity linking, etc., at scale
- Public corpora, crowdsourcing, labels, AIAI platforms, human-computer systems
- Semantic modeling, knowledge bases, ontologies, legal search
- Personalized legal NLP -- e.g. understanding individual judges from their output
- Deep Learning on legal corpora
- And more!
Last year, we started with a two-day, three-track, 50-talk conference. We've put together an inspiring program centered around language, Big Data, text and images, deep learning, UI, social networks, and much more.
This year, we're running the first data grid conference sequence with with seven verticals over five days. Each day's attendance is limited to only 400 seats and it will be full. We hope you join us in May By the Bay!
Keynote Speakers
Watch this space for the inspiring talks by leaders in each area
Data Pipelines By the Bay
May 16, 2016
Building on Big Data Scala, this is the first conference showing end-to-end unity of Data Engineering and Data Science for big, fast, streaming data.
Text By the Bay
May 17-18, 2016 (Day 2 parallel with Democracy By the Bay and Law By the Bay)
The first applied NLP conference for the Bay Area, building on the highly-acclaimed 2015 edition: 50 talks from 50 top companies, all online at functional.tv.
Democracy By the Bay
May 18, 2016 (parallel with Law By the Bay and Text By the Bay)
NLP and Data Science with focus on politics, society, and government.
Law By the Bay
May 18, 2016 (parallel with Democracy By the Bay and Text By the Bay)
NLP and Data Science with focus on legal data and processes.
Legal search (100% recall), case-specific NLP, ambiguity analysis, etc.
AIoT By the Bay
May 19, 2016
Not everything is text. Multiple talks at Text By the Bay dealt with multi-modal data such as images with text. AI and IoT day is all about sensor data streams, images, vision, speech, music.
Life Sciences By the Bay
May 20, 2016 (Parallel with Data UX By the Bay)
There are several major categories of data mining related to life and health. First, genomics -- Bay Area leads with Spark and ADAM. Second, medical sensor and imaging data, with companies like Enlitic.
Data UX By the Bay
May 20, 2016 (Parallel with Life Sciences By the Bay)
Data should be visualized, with massive datasets distilled into clear and actionable display calling attention to what's really important. And then UX should naturally lead to the appropriate action.
Data By the Bay – Common Thread
May 16-20, 2016
For each conference, we'll have a common horizontal themes: platforms and algorithms.
Our Sponsors
Friend Sponsors
Media Sponsors
Technology Insights and Events
Be a supporting member of San Francisco's premier Data/AI conference. We want to hear from you! Contact us for a prospectus and sponsorship agreement, or to talk about how we can help you be a contributing sponsor for the Data By The Bay conference!

The Agenda
Come to Text By the Bay well-rested and ready to meet your fellow developers. We'll have a full day of talks (keynotes, full-length, and lightning) and build a startup-centric data engineering community for the Bay Area!
Get Updates
Stay informed with the Text By the Bay conference news and event updates.
If you'd like to sponsor Text By the Bay, contact sponsors@bythebay.io
Map
Conference Schedule
Conference Tickets
You can buy tickets for two or more days of the conference as passes. Once you buy a pass, you will receive an email with instructions on how to redeem the days you want. Each day has the capacity of 400 and will automatically be disabled once full. We'll add the days that are sold out on the TICKETS page as soon as they become unavailable.
Currently available days: Day 1, Day 2, Day 3, Day 4, Day 5.
Pricing works as follows: regular admission is $500/day. Very Early Bird is $400/day, Early Bird is $450/day, and late Bird is $550/day. We will only allocate 100 Very Early/Early Bird tickets for each day, since our capacity is limited and the word is only getting out. The passes are 2/3/4/5-day bundles, discounted $50 per each extra day (so 2-day Very Early Bird Bundle is $750, 2-day Early Bird Bundle is $850, 2-day Regular Admission Bundle is $950, etc.). We use Stripe directly to process all payments.
Full-time students inquiring about discounts: please email proof of enrollment and dates of interest.