Kuvitus: Topias Dean, Sitra

Published June 16, 2020

Ihan.fi - frequently asked questions

What is ihan.fi? The first version of the testbed and toolbox for developing services for the fair data economy was launched in June 2020. Here you will find the frequently asked questions concerning the site.

These frequently asked questions on the fair data economy and Ihan.fi website have been collected during the project, among others during the opening event of the site on 11 June 2020.

If you have a question you cannot find the answer to, please contact us at ihan@sitra.fi. We are happy to help!

General

1. What is the data economy? And what does the fair data economy mean?

The data economy is a universe of initiatives, activities and/or projects whose business model is based on the exploration and exploitation of the structures of databases to identify opportunities for generating products and services. The fair data economy is a part of the economy that focuses on creating services and data-based products in an ethical manner. Fairness means that the rights of individuals are protected and the needs of all stakeholders are taken into account.

2. What is ihan.fi? 

The ihan.fi site demonstrates how services can be built with the IHAN fair data economy infrastructure. The testbed, tools and demo applications for creating fair data economy services will be compiled under the ihan.fi platform, which will support service development on a one-stop-shop basis.

The site is being built in phases throughout 2020. The alpha version was released in June. During the summer, selected companies will start experimenting on the testbed. By the end of the year, the testbed will be released for the whole internet community.

The long-term goal of ihan.fi is to support new internet standards for data productising, portability and interoperability, and to boost the emergence of global data markets.

3. Why was ihan.fi built?

Sitra’s task is to build a successful Finland for tomorrow. The purpose of the fund is to promote the stable and balanced development of Finland, quantitative and qualitative economic growth and international competitiveness.

The better use of data and the emergence of data markets will enhance growth. Sitra has here a clear role as a neutral mediator, as regulation proceeds with it’s on pace, and companies are not necessarily capable of investing in this type of testbed. As a public body, all of our outputs are by default open and available to everyone.

4. What is the site’s target group?
The site is for all organisations and individuals interested in the data economy.

Ihan.fi and the testbed

5. What can I do on the site?
In the alpha-phase you can check out for example the rulebook template and pre-standards for data sharing, as well as a well-being app demo set up according to IHAN requirement specifications which offers a great example of a fair data economy service. By the end of 2020, developers will be able to log into the development environment, to test building those real fair data economy services with previously tested components and to provide their own data sets for others to use.

6. Can I get involved now?
You can join the community by ordering our newsletter.If you have a ready-to-go business pilot focusing on the exchange and reuse of data, you can contact us at ihan@sitra.fi for details. By default, we offer the testbed and technical support free of charge. We will not be funding any new pilots at this time.

7. How can I receive information about the opening of the site to everyone later on?
For the final and full version, we will arrange a launch event later this year. Check out the current release of ihan.fi and stay tuned for the next developments. More info on the project page.

8. Is the testbed solution purely for Finland? If not, how are you planning to drive a unified data language for global harmonisation? How will you deal with different character sets for true internationalisation? 

As the standards, architectures and concepts are built for the future Web, the testbed is also built for the global community. We welcome any nationality, company, individual and organisation to use the testbed as long as you adhere to the testbed rulebook defined by the community. Global standardisation relies on separating the linguistic semantics from the real-world context. The data product is a meta wrapper that describes the context of an IP payload in terms that both humans and machines can understand. Part of those standard attributes are the language the product is made available in. This allows, for example, to create a energy certificate of a building in any language.

Data product standards are meant to be linked directly with the real world industrial and international standards used in commerce, trade, finance, banking, manufacturing and the built environment. By defining the digital identities, e.g. through the harmonised system, we can enable adoption in any industry and company to model data products in any manufactured or traded commodity globally. The same goes too for the financial assets modelled by the ISIN-coded identities.

The scaling work has been kickstarted through open collaboration, agile experimentation, strong community and with the backing of Sitra’s role as a public utility with strong European connections. We are building on the proven and scalable technologies and concepts.

9. How is the privacy of the test user companies protected?
In this phase, the testbed is a closed environment. Security is one of the key aspects in the architecture design and audits are being planned as part of the development phases. By the end of 2020, when the testbed is opened to public, a common rulebook will define the rules for operating in the ecosystem.

10. How do we ensure that standardisation of data productising proceeds on EU level?
An open and motivated community and real cases built by the leading industry players and governments are the keys to success.

11. What is the approach to enabling semantic interoperability in the testbed?
The approach is linked digital identities and data products with global standards. The semantic interoperability is created by offering global core standards for data products and also digital identities. On the testbed the OpenAPI specification and JSON-LD based vocabulary and ontology is available in Github as a core ontology you can use to build your own industry and company specific ontologies. The testbed also offers the product gateway and the global identity graph for semantic discovery and full interoperability between software and data sources. The data source is linked only once to the standards through the data product onboarding process. After that the data becomes discoverable without any additional harmonisation effort needed from the consumers of data. One identity, all the data – any  application.

12. Are all the datasets in the system based on JSON-LD effort?
At this point of the standardisation the data products are based on OpenAPI specification 3.0 and the JSON-LD which together enable structured data exchange with contextual linking. These are so far seen as the most prominent candidates for standardisation as the OpenAPI specification has a strong footing in the market as does JSON-LD in W3C. Given that after a few years of practical use we have found out that it also needs further development.

13. My company is building an ecosystem for different roles and partners in which we will give data control completely back to the users. Can we use the data standards and protocols in the IHAN testbed as a service to integrate into our ecosystem? As I understand, we could then work as a data source via IHAN for third parties. Is that correct?

The whole point of the standards is that you can use them any way you wish to create your own ecosystem. If you want to use the testbed, the only restriction is that you accept the terms and conditions based on the upcoming testbed rulebook. You can then offer data to anybody in the IHAN ecosystem or build solutions using any data offered as you wish.

 

 

SISU ID

14. For interoperable authentication, do you use OpenID Connect or something else that is already a standard?
SisuID supports both OpenID Connect (OIDC) and SAML 2.0 integration standards, but OIDC is the preferred integration method for relying services.

 15. Which database technology and architecture are you using?
SisuID is using relational database management system (RDBMS) technology. Any SQL database like MySQL, Oracle or MS SQL server can be used. The current version of the SisuID is using MySQL database running on AWS.

 16. Is SisuID using self-sovereign identity technology?
Not yet, but we have already carried out a proof-of-concept pilot implementation in the previous phase of the project where we implemented decentralized identity capabilities in the SisuID ecosystem on top of the FIndy identity ledger.

Well-being application demo

17. In the well-being app, who is running the AI that Aino is using? Is this provided by the data operator as a service? Who is “listening” to the conversation with AI Bot?

The AI is just software in the Web i.e. selected and ran by the application provider and is open for any suitable AI/NLP solution to be used as long as the application uses the data product standards enabling the easy use of any compatible online data.

As the application developer, you decide who “listens in” to the data traffic by selecting the most suitable AI solutions, cloud services and data sources for your applications – it’s just the internet. With the help of trust architecture you can of course have better authentication and consenting mechanisms. We also plan to implement the data product exchange protocol with a fully secure data exchange layer preventing any “man-in-the-middle” attacks or any party in the network being privy to the data product contents that is being transferred.

In the current architecture there is no “data operator” role, only data providers and application developers. In reference to “MyData operators” they are just that – providers of the GDPR-compliant consent for data use and the decentralised trust oracles the productisers use to validate the existence of consent before the data products are released.

18. In the well-being app – who is paying for the data and when?

The point of the demo wasn’t first most demonstrate a business model rather the real value created for an individual with real data that “never meets”. For this demo the most probable customer would be an app user where the price of the data products are included e.g. in a monthly app subscription that yields profit for the app developer. It is up to you to decide who should pay and when – the options are not limited anymore to primarily selling the users’ data for marketing rather creating real world services with it.

19. In the well-being app, what is the governance behind the data exchange model?
The governance model is always selected by the data sharing communities and ecosystems themselves. The rulebook offers a great starting point for deciding on your own governance model. The data product exchange standards should allow for automated policy control between different ecosystems and rulebooks to also enable global interoperability between the governance models.

What's this about?