Brussels, Belgium
Tuesday 2 - Friday 5 April 2024
B-Cubed’s hackathon was a 4-day event, bringing together biodiversity informaticians, researchers, and practitioners who are passionate about leveraging biodiversity data for impactful solutions. Our common goal was to standardise biodiversity data in order to enhance efficiency and accessibility. The main idea was to experiment with data cubes and channel creativity into innovative solutions for a variety of biodiversity challenges.
We strongly encouraged students to participate, bringing their fresh perspectives and knowledge to engage with data cubes and connect with leading experts in the field. The international B-Cubed hackathon acted as an opportunity to push boundaries and experiment with data cubes, while tackling an array of challenges.
Eleven innovative projects have been selected for
B-Cubed’s hackathon!
A presentation at the start of the competition introduced participants to the concept of data cubes. Throughout the competition, they also benefited from support provided by a panel of experts, available for guidance for up to an hour each day. Additionally, there were many opportunities to attend interesting presentations.
In teams, transformative ideas were presented to a jury panel composed of B-Cubed’s sister projects. The top three teams that excelled in efficiently integrating cubes, developing innovative solutions to address obstacles, and effectively visualising their work for policy makers were declared the winners of the Hackathon. Furthermore, the contributions played a pivotal role in advancing data cube technology, aiding in the enhancement of data preservation and monitoring on a global scale.
Summary
In the Netherlands, collective farmer contracting to make biodiversity and habitat conservation on a landscape level more ecologically efficient, has long been implemented. Multidimensional data cubes can be a great vehicle to integrate individual data across farmlands and monitor conservation efforts on a landscape level. A landscape level approach is crucial for conservation of biodiversity and habitats as farmland birds and other species don’t adhere to the individual farmland level, hence their conservation requires cooperation across farmers.
Needed skills
Python, EBVs, biodiversity monitoring experience in agricultural setting, handling and processing ecological long tail data, species distributions.
Summary
In this project, we aim to create a simulation framework for biodiversity data cubes using the R programming language. This can start from simulating multiple species distributed in a landscape over a temporal scope. In a second phase, the simulation of a variety of observation processes and effort can generate actual occurrence datasets. Based on their (simulated) spatial uncertainty, occurrences can then be designated to a grid to form a data cube. Nevertheless, we encourage you to think out of the box and provide new ideas as well. For the occurrence-to-cube designation, R code is already available.
Needed skills
R programming; Algorithm development; Creative thinking; Problem solving; Efficient coding; Collaboration (Git).
Summary
For the finance and insurance sectors to benefit from opportunities provided by nature and mitigate nature-related risks, reliable biodiversity monitoring data is a must. To encompass the many dimensions of nature and service the Financial Services market, NatureMetrics developed the NatureMetrics digital dashboard which combines biodiversity footprint, biodiversity impact and biodiversity trends metrics. Thus, this framework concomitantly measures biodiversity pressures and responses. This project proposes the incorporation of standardised biodiversity data from B-Cubed within iNPI, for example, during design of open-source connectivity algorithms at high spatial resolution to link with the framework’s restoration tracker element, to maximise the efficiency and return of investment of biodiversity management interventions, and demonstrate the high-resolution required for the Financial Services market.
Needed skills
Data scientists; Bioinformaticians; Statistical ecologists; Mathematical modelling; Machine learning.
Summary
This project explores the added value of biodiversity data cubes to identify critical habitats using Deep-Learning Species Distribution Models (Deep-SDMs). The conventional Deep-SDM approach often relies on observation point data, which may struggle to capture the dynamic temporal evolution patterns of species distributions. The integration of biodiversity data cubes with environmental and remote sensing data will be performed to improve Deep-SDMs and thus better describe habitats. To illustrate this approach, we will explore a use case based on floristic data from France and Belgium. Using detailed floristic data from these two countries, we will demonstrate how the integration of data cubes can contribute to the modelling of species distribution over an extended geographical scale, taking into account environmental and remote sensing data. This case study will serve to highlight the potential benefits of using data cubes in biodiversity modelling.
Needed skills
Knowledge of GBIF data and data cube extraction, data-viz and machine/deep learning algorithms.
Summary
This project will funnel relevant biodiversity databases such as OBIS, GBIF or ETN, and environmental data sources such as Copernicus Marine Service or Bio-Oracle, into a Spatial Temporal Asset Catalog (STAC). This OGC-compliant specification aligns with the GBIF API. The STAC back-end will serve as an open-access entry point of ecological modelling-ready data cubes following the B-Cubed software specification. These cubes will be compatible for plugging into machine learning and modelling workflows. The project will showcase its potential for early risk assessment of non-indigenous species invasions with SDMs to calculate the probability of potential invasive species to colonise an area. It will build upon existing tools and develop new ones using well-known data science programming languages such as python or R, allowing reuse for other marine scientists, policy-makers and the wider community.
Needed skills
Policy and communication.
Summary
Spectral diversity serves as a measure for vegetation and trait diversity of plant communities and their geological surroundings. From a Critical Zone perspective, it considers plant communities’ trait expression, accounting for taxonomic composition and abiotic factors. Emphasising trait response to abiotic factors changes requires considering the temporal dimension and taking a phenological stance. The R package rasterdiv facilitates spectral diversity analysis, in particular with Rao'Q. It enables dealing with multidimensional traits, assuming they are uncorrelated which is not the case for vegetation index time series. This project will implement a method in rasterdiv, allowing it to deal with times series, opening the possibility to correctly use phenological data to explore plant trait biodiversity.
Needed skills
DTW libraries, Phenology Data, algorithms, and resources to obtain them; Rasterdiv package knowledge; Scripting capacity in R/python; Knowledge of use case data & community ecology.
Summary
The project, part of the doctoral thesis "Improving Semantic Interoperability of observational research through data FAIRification," focuses on representing eLTER Standard Observations (SOs) for eLTER PLUS (871128). The SOs will include the minimal variables set and associated method protocols to characterise Earth systems’ state and future trends. The project will semantically enhance variable descriptions in EnvThes with the I-ADOPT Framework, enabling semantic precision and machine-readability. For the B-Cubed hackathon, the project targets biodiversity variables, including flying insects, vegetation composition and occurrence data of birds using acoustic recording observed on the LTER Site “Zöbelboden”. Building on existing specifications like the Species occurrence cube, the project aims to contribute to FAIR, semantically rich representations, ensuring interoperability beyond eLTER requirements.
Needed skills
FAIR data management; Data modelling; Knowledge engineering; Semantic modelling; FAIR metadata schema development; Jupyter Notebook & Python.
Summary
In most cases, there is no complete information about the ‘reality’ of the focal species distribution besides the data collected in-situ. This is partly because the completeness of the data extracted from surveys (recorded in-situ) is difficult to measure. This affects our ability to detect the real spatial coverage as well as the niche of a species with the records available. These limitations, in turn, can seriously flaw final results of species distribution models, by distorting the relationship between species occurrences and the underlying environmental patterns. Quantifying error sources is crucial for effective modelling, especially in conservation efforts. Under this scenario, using simulated or in-silico datasets — the so-called ‘virtual ecologist’ approach — allows to generate distribution data with known ecological characteristics, helping to simulate and thus account for spatio-temporal and taxonomic noises, thanks to the complete control on the configuration of factors constraining the distribution of species.
Needed skills
Knowledge of R; Programming/modelling; Ecological/conservation skills.
Summary
Species occurrence cubes will be available in a limited range of formats, which may hinder the reuse of these cubes in the future. This project aims to increase the range of formats that species occurrence cubes will be available and make them as ready-for-use as possible.
Needed skills
Knowledge of existing conversion tools and/or scripting using libraries using common programming languages such as Python, C, Node, Java etc. The choice of tools or scripting language is open for participants to choose – the more the merrier!
Summary: Traditional biodiversity monitoring programmes usually hold well-curated data of species occurrences, with high spatial and temporal resolutions but often only for restricted areas. In recent years, technological developments along with online digital platforms have increased the amount of information available on species occurrences, both from the scientific community as well as from community-based contributions such as citizen science. The increase in data availability and accessibility poses the challenge of integrating such highly heterogenous data sets on species occurrences, but in the context of the implementation of biodiversity policy objectives at wide scales, making the most of all information available is crucial. While integration of species occurrences from different data sources cannot be discarded, we can expect that variability in sampling methods, spatial and temporal resolution and coverage, as well as data confidence levels, will introduce bias in the assessments.
Needed skills: Programming, EBVs particularly on species distributions, biodiversity monitoring experience, indicators development and use.
Summary: Biodiversity indicators provide essential information for policy-makers, but often are only accessible to those with an understanding of coding and the time to learn a new package, namely research scientists. That means many users must rely on ready-made or commissioned analyses. Within the EU-project B-Cubed, we are providing flexible workflows to align with policy objectives. But to be truly user-friendly, these should be provided in graphical, interactive format that does not require the user to understanding coding. The goal of this hackathon project is to develop a front end for a general biodiversity indicators package e.g. in the form of a Shiny app.
Needed skills: Shiny and R, ideally with a bit of S3 knowledge.
B-Cubed’s hackathon gathered people to brainstorm and implement ideas based on their projects during an intense 4 days of productive hacking sessions. The accepted projects were aligned with B-Cubed’s objectives. The focus was on FAIR data.
Test the interoperability of data cubes with other biodiversity and environmental data.
Engage the biodiversity and ecological informatics community in innovation with cube data
LEARN - CONNECT - EXPERIMENT
Learn about data cubes through other participants, technical coaches and various presentations from keynote speakers
Connect with experts from diverse backgrounds
Experiment with data cubes
Create user-friendly and interactive data visualisation tools that enable researchers and policymakers to explore and understand biodiversity data cubes effectively.
Explore methods to integrate diverse datasets, such as satellite imagery and climate data with biodiversity data cubes that can provide a comprehensive view of ecosystems and species distributions.
Develop algorithms and techniques to assess biodiversity trends from data cubes, identify critical areas for conservation, and monitor changes in species distributions over time and space.
Build models that predict species habitat suitability, biodiversity hotspots, and potential threats to specific ecosystems.
Disseminate standards and methods for ensuring data cube interoperability, making it easier to share and combine biodiversity data cubes with other environmental data from a variety of different sources.
Develop techniques to identify and handle data inconsistencies and errors in biodiversity data cubes, ensuring the accuracy and reliability of the analyses.
Examine ways upon which data services can be built to access and interact with biodiversity data cubes.
Design educational tools and platforms using biodiversity data cubes to raise awareness about the importance of biodiversity conservation and engage the public in conservation efforts.
Identify and explore potential use cases for biodiversity data cubes across various sectors, such as agriculture, health, urban planning, and conservation policy-making.
Organise tutorials that provide participants with the necessary skills and knowledge to work with data cubes once they return home. These will cover topics such as building cubes, data analysis and visualisation.
Registration for the Hackathon amounted to a total of 60 euros for general admission and 12 euros for students. Lunch and coffee breaks were provided for all participants throughout the 4 days. However, attendees had to pay for their own travel and accommodation expenses.
General Admission:
60 euros
Student Rate:
12 euros
Virtual registration:
12 euros
Note: Buying one ticket gives you access for the full duration of the hackathon.
On the first day (2 April 2024), there was an option for BattleKarting at 17:00, featuring indoor electric go-karting and augmented reality. Those who wished to participate selected the “Battlekart” option during ticket registration, which added €20 to their ticket price. More information on this website.
On the last day (5 April 2024), drinks took place at Tour & Taxis - Food-Market, where the winners were announced. This was included in the ticket price, selecting the box when registering.
Herman Teirlinck
Havenlaan 88
1000 Brussels
We provide 11 breakout rooms accommodating up to 12 or 25 persons each. All rooms are equipped with audio and video conferencing equipment to facilitate smooth hybrid work and meetings with colleagues who cannot attend in person. Additionally, a spacious reception area and auditorium are at your disposal.
The Herman Teirlinck building is located 1 km from Brussels-Nord station and is easily accessible on foot, by bike, public transport or the free shuttle bus. It’s best to take the northern entrance from Brussel-Nord station, opening onto Solvayplein.
On foot: A 16-minute walk to Herman Teirlinck building.
Public transport: The MIVB (Brussels public transport) stop ‘Tour et Taxis’ is near the Herman Teirlinck building. Lines 14, 20, 57 and 88 serve these stops. Tram line 51 stops at ‘Vanderstichelen’ on Jubelfeestlaan and at ‘Sainctelette’ at the end of Havenlaan.
Shuttle bus: A bus departs every 5 minutes from Nord station to the Herman Teirlinck building (Tour & Taxis). The service operates from 5:30 AM to 10 PM. Follow the bus with the App.
Explore the following accommodation suggestions near the Herman Teirlinck building in Brussels.
Karel Rogierplein 20, Sint-Joost-ten-Node, 1210
Close to bus stop ‘Brussel Rogier’
Kruisvaartenstraat 6 - 10, Sint-Joost-ten-Node, 1210
Close to bus stop ‘Brussel Rogier’
Hilton Ginestestraat 3, 1210
Vooruitgangstraat 9, 1210 Sint-Joost-ten-Node
Adolphe Maxlaan 118/126, 1000
Victoria Reginaplantsoen 1, 1210
Adolphe Maxlaan 7, 1000
Koningsstraat 250, 1210 Brussel
Koningsstraat 120, 1000 Brussel
Léopoldstraat 9, 1000 Brussel
Brandhoutkaai 51, 1000 Brussel
Submit results of the projects with BioHackrXiv
In the spirit of collaborative innovation, participants of “Hacking biodiversity data cubes for policy” are encouraged to submit the results of their projects to BioHackrXiv following the Hackathon. BioHackrXiv serves as a centralised hub where the outcomes, findings, and solutions developed during the Hackathon can be shared with a broader audience. This platform not only provides participants with an opportunity to extend the impact of their work but also contributes to the collective knowledge within the community. Details on how to submit your project results to BioHackrXiv can be found here.
Data protection
Hacking biodiversity data cubes for policy is committed to safeguarding the privacy and protection of participants' personal data. Any information provided during the registration process or throughout the event will be handled with confidentiality. By registering for this event, you acknowledge that the information you provide, including personal details, may be processed. Rest assured that we adhere to data protection regulations, and your information will only be used for communication related to the hackathon. We will not share or sell participant data to third parties.
Intellectual Property disclaimer
All intellectual property rights associated with the solutions, projects, or content developed during the hackathon are owned by the respective participants or their collaborating teams. However, by participating, you grant the B-Cubed project a non-exclusive license to showcase, promote, and disseminate the submitted work for educational and promotional purposes. Participants confirm they will not infringe the copyright of any third party.
B-Cubed Hackathon projects should be open source. Code added to the Hackathon projects repository will be open source under the MIT license. It is always possible for participants to create their own repository with a license of their preference, but they should link to it from the Hackathon repository.
Code of conduct
We are committed to fostering an inclusive and respectful community where all participants can collaborate, learn, and share their experiences. This Code of Conduct outlines our expectations for behaviour within our community and provides a framework for maintaining a positive and safe environment for everyone involved. By participating in the Hackathon or engaging with our community, you agree to adhere to the principles outlined here.
If you would like to file a complaint, witness something inappropriate, or have other concerns, please contact us at b-cubedsupport@meisebotanicgarden.be.
University engagement
This Hackathon is given in kind support by Belgian Universities, including KU Leuven, Ghent University, University of Antwerp and the Université catholique de Louvain.
Funding
The Hackathon is made possible with the funding of the European Union and the support of the Research Foundation - Flanders (FWO)
Laura Abraham, Meise Botanic Garden