Simple sql table export in ABAP for HANA

June 16, 2013, 4:00 pm

≫ Next: SAP HANA Installation in Oracle VirtualBox VM

≪ Previous: Mike Eacrett - 2013 SAP TechEd Speaker of the Week

just read a document Simple csv table export in ABAP for HANA and decided to share my own experience in exporting DB Tables to HANA via ABAP.

Idea was to play with HANA and to try it's functionality for educational purposes.

For data extraction I wrote a simple ABAP Report, which extracts selected tables with its data and prepares sql script for import.

Here is a source code (s. attached file).

Here is a screenshot of selection screen.

You can define a schema name and select if a new schema must be created or existing one used.

Of course, you define table names for extraction (only structure or with data).

Additionally you can select if the files should be sent per mail or directly downloaded to selected folder on your local hard drive.

By extraction the data are splitted by 65535 entries into separate files. SQL Scripts are zipped before sending/downloading.

On the next screenshot you can see result of extraction. I selected some tables, which represent a purchasing documents in SRM Solution.

after extraction...

For the mass import of the sql scripts I created a simple cmd-script on windows.

Here is an example only for BBP_PDBEI and BBP_PDIGP tables.

C:
cd "C:\Program Files\sap\hdbclient\"
hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_1_BBP_PDBEI.sql
hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_2_BBP_PDBEI.sql
hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_3_BBP_PDBEI.sql
hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_4_BBP_PDBEI.sql
hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_5_BBP_PDBEI.sql
hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_6_BBP_PDIGP.sql
hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_7_BBP_PDIGP.sql
hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_8_BBP_PDIGP.sql
hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_9_BBP_PDIGP.sql
hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_10_BBP_PDIGP.sql

Here I provided path to Hana DB Client intalled and to the files, which were saved in V:\temp folder.

By the import I mentioned, that only one processor core used by the sequential processing.

So, I changed the script for parallel processing

C:
cd "C:\Program Files\sap\hdbclient\"
START "" hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_1_BBP_PDBEI.sql
TIMEOUT /T 2
START "" hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_2_BBP_PDBEI.sql
START "" hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_3_BBP_PDBEI.sql
START "" hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_4_BBP_PDBEI.sql
START "" hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_5_BBP_PDBEI.sql
START "" hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_6_BBP_PDIGP.sql
TIMEOUT /T 2
START "" hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_7_BBP_PDIGP.sql
START "" hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_8_BBP_PDIGP.sql
START "" hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_9_BBP_PDIGP.sql
START "" hdbsql.exe -i 11 -n hanadb -u SYSTEM -p ********** -I V:\temp\hana_script_10_BBP_PDIGP.sql

I gave a paar seconds timeout after each table creation. Tables must be created first, only then start parallel import of content.

Now it looks much better. All CPU cores are used, import runs more efficient...

And here is a result

Looks great...

Problems I encountered:

Import is relative slow. I supposed SQL Import will run much faster.
Memory consumption of ABAP Report is very high - may be some advices from you, how to optimize it.
HANA does not understand field names with "/" sign. SRM uses some field names (and even tables) as /SAPSRM/*
ABAP writes negative values as '1-', HANA needs '-1'.
How to export cluster tables? Actually I did not need it. But it would be interesting to know...

P.S.: English language is not my native language, and any person is not insured from mistakes and typing errors. If you have found an error in the text, please let me know.

P.P.S.: If you have some ideas, how to correct/improve the report - please don't hesitate to leave a comment.

↧

SAP HANA Installation in Oracle VirtualBox VM

June 16, 2013, 3:51 am

Latest and popular articles on SAP ERP

≫ Next: Connecting to your hana database from php using odbc.

≪ Previous: Simple sql table export in ABAP for HANA

A very interesting article was bought to my attention about two weeks ago regarding the installation of SAP HANA Platform Edition 1.0 SP05 on a VMware virtual machine (most likely VMware player). Thus being a SAP HANA enthusiast I decided to undertake the same process using Oracle VirtualBox v4.2.12. I had been a big fan of VMware player for a long time, but about 2 years ago I switched to VirtualBox (reasons that I won't get into right now).

So, after finally getting my purchase order approved by my wife, I upgraded my PC to 32GB of RAM and installed SAP HANA PE1.0 SP05 into VirtualBox running SAP SUSE Linux Enterprise Server 11.2.

The installation is relatively straight forward with a couple of minor VirtualBox issues. The full instructions can be found here thanks to W. Goslinga :

http://scn.sap.com/community/developer-center/hana/blog/2013/05/08/how-to-install-the-hana-server-software-on-a-virtual-machine

The key quirks with VirtualBox were:

You need to enable the CMPXCHG16B instruction after you have created the guest in VirtualBox. Without the CMPXCHG16B instruction enabled the HANA installation will fail.
VirtualBox with SUSE 11.2 running on my Intel i7 reported the number of CPU sockets as 0. Thus the HANA hardware check would fail with a divide by 0 error and terminate the installation regardless of the IDSPISPOPD environment variable. I manually updated the HanaHwCheck.py shell script and forced the number of sockets to 1.
One last issue, I had the HANA media sitting on a different VM and had NFS mounted the filesystem to my HANA host. I had a number of packages that failed to "untar" during the installation until I mounted the NFS share as a "rw,hard,intr" mount. Obviously the NFS soft mount was playing nice over my internal network.

Technical bits:

cd <virtualbox install dir>; VBoxManage setextradata [vmname] VBoxInternal/CPUM/CMPXCHG16B 1
vi <path to HANA media>/DATA_UNITS/HDB_SERVER_LINUX_X86_64/server/HanaHwCheck.py
- comment out the line > self.HWInfo['CPU Sockets']=len(lines)-1
- insert line > self.HWInfo['CPU Sockets']=1 (or set to the actual number of sockets you have)
If using an NFS mount ensure its set to a "hard" mount eg: vi /etc/fstab

<nfshost>:/software

/software

nfs

rw,hard,intr 0 0

Below is a screen shot of SAP HANA Studio directly after I finished the installation of HANA.

I plan to post a YouTube video of the installation process shortly. The session will cover the full life cycle from VirtualBox guest creation including network config, through to the completion of the HANA installation. We'll also install SAP HANA Studio on the physical PC and connect to the HANA backend. Stay tuned.

↧

Connecting to your hana database from php using odbc.

June 17, 2013, 5:53 am

Latest and popular articles on SAP ERP

≫ Next: A simple rule to live by when using the SAP HANA IMPORT FROM command - Don't forget the ERROR LOG clause

≪ Previous: SAP HANA Installation in Oracle VirtualBox VM

I had spent quite some time to make a connection to my hana database from my PHP page. I did find a lot of help from the forum. I just want to take the time to help out anyone new to SAP HANA like myself. The first thing one needs to know is that the PHP's 32 bit usually, so you'll need to install a 32 bit hana client(to get 32bit hana odbc drivers) to make odbc connections from your PHP page. Here's a howto : http://www.youtube.com/watch?v=au7eziBLAtU . You can check if the installation went all right by opening ODBC Data Sources (32 bit) by searching "ODBC Data Sources", it is usually located in "C:\Windows\SysWOW64\odbcad32.exe". If the driver HDBODBC32 is not listed in the drivers tab, you'll have to add a new data source from the System DSN tab. Once you have the HDBODBC32(32 bit drivers) you are all set. Also note that the php odbc is set by default, so you probably won't have to modify your php.ini. Im using xampp and did not have to do anything.

Here's some working sample code.

<?php

$driver

= "HDBODBC32"; // 32 bit odbc drivers that come with the hana client installation.

$servername = "yourservername.vm.cld.sr:30015"; // Enter your external access server name

$db_name	= "HDB"; // This is the default name of your hana instance.
$username	= "SYSTEM"; // This is the default username, do provide your username
$password	= "manager"; // This is the default password, do provide your own password.
$conn	= odbc_connect("Driver=$driver;ServerNode=$servername;Database=$db_name;", $username, $password, SQL_CUR_USE_ODBC);

// example query string.

$queryString = 'INSERT INTO "SCHEMA_NAME"."table_name" (SiteID,Date_Time,SensorValue,KVA,PF,ErrorLog) VALUES('.$siteID.',\''.$time.'\','.$sensorValue.','.$kva.','.$pf.',\''.$errorLog.'\' )';

//echo $queryString; to get clarification, you can copy and paste your query string in SAP HANA Studio and see the results.

// if condition's optional.

if ($conn)

{

odbc_exec($conn, $queryString); // odbc_exec prepares and executes the sql statement.

}

↧

A simple rule to live by when using the SAP HANA IMPORT FROM command - Don't forget the ERROR LOG clause

June 17, 2013, 8:24 pm

Latest and popular articles on SAP ERP

≫ Next: A peek inside xSync and the HANA XS Engine

≪ Previous: Connecting to your hana database from php using odbc.

In life, there are simple rules to live by. For example, Jim Croce told us that "You don't mess around with Jim". In the database world there are similar rules like "you never issue an UPDATE or DELETE statement without a WHERE clause. In this blog post, I'm hopefully going to convince you to add another rule - "Always include the ERROR LOG clause with your IMPORT FROM command".

I'm currently working on a Big Data project where I'm importing the results from a generated data file based on Wikipedia page count data. After doing the import operation, I did a SELECT COUNT(*) on the resulting table and then got to wondering - was I hallucinating or am I missing almost 2 million rows of data?

It turns out that I was missing more than 2 million rows of data - yikes! So what's going on? When I ran the IMPORT FROM command, it reported that it took 45 seconds, affected 0 rows (that alone is a bit disturbing) and that there were no errors. So, again - what's going on?

Since the data I'm getting from Wikipedia could be suspect, my first inclination is that it had to be a problem with using a pipe "|" symbol as a delimiter. My original file actually used the 0x01 character as a field delimiter and I has used the following sed Linux command to change them to the pipe character:

sed "s/\x01/|/g" 000000 > 000000.csv

I then used the wc (word count) command to count the number of rows in both files to compare the results.

hana:/wiki-data/year=2013/month=05 # wc --lines 000000*

4755634 000000

4755634 000000.csv

9511268 total

As you can see, the line counts were identical. So, I opened up the help topic for the command at http://help.sap.com/hana/html/sql_import_from.html and noticed that there is a clause called "ERROR LOG", so why not give it a try. I went ahead and added the following clause:

ERROR LOG '/wiki-data/import.err'

After running the IMPORT FROM command again, I got no errors, but what was weird was I also had no import.err file in my /wiki-data directory. This I've seen before, so I issued the following Linux command to make sure the SAP HANA database engine can write data to this directory:

chmod 777 /wiki-data

Lo and behold, I ended up with a 55 meg import.err file! It turns out that there were two things preventing the load of all the data. First, one of my column definitions was not large enough to support the longest of the Wikipedia page titles which was 1023 - more than double of the VARCHAR(500) that I had defined. So, I dropped the table and recreated it with a column length of 2000 to be on the safe side. I then came across a new - numeric overflow. It turns out I needed to use a BIGINT data type for the number of bytes download for an hour for pages. After making that correction, I now got the COUNT(*) to match the line count for the three CSV files that I imported.

I was lucky and noticed that the COUNT result didn't seem right and tracked it down, but I'm guessing that most people that use the IMPORT FROM command aren't using the optional ERROR LOG clause. So - back to the new rule.

Create a directory that you will use for your error file and make sure the HANA database engine has rights to the file using the chmod 777 <directory name> command.
Just because the IMPORT FROM command reports no errors when running it from SAP HANA Studio, doesn't mean there were no errors. Always include the ERROR LOG clause and then check to see that it's a zero byte file. Otherwise, open it up and examine the records.
Tell your friends and colleagues about this rule.

So in the spirit of Jim Croce, you can check it out my updated lyrics and sing along:

"You don’t tug on Superman's cape"

"You don’t spit into the wind"

"You don’t pull the mask off that old Lone Ranger"

And you don't forget the ERROR LOG clause for IMPORT FROM command.

Again, data is a precious thing to waste, so please pass this on.

Regards,

Bill Ramos

↧

A peek inside xSync and the HANA XS Engine

June 18, 2013, 5:58 am

Latest and popular articles on SAP ERP

≫ Next: Real-time sentiment rating of movies on SAP HANA One

≪ Previous: A simple rule to live by when using the SAP HANA IMPORT FROM command - Don't forget the ERROR LOG clause

On saturday I published a blog about a small app I wrote called xSync - basically a XS Engine app for Mac developers where you can sync a local development folder with your HANA repository. This is for rapid development and to encourage the "bring your own IDE" approach to application development on HANA. Here is a look behind the scenes on how the app works and some of the challenges of the project.

As mentioned in my previous blog - I started using the IDE Lightweight editor after doing the upgrade of my AWS HANA box last weekend. I enjoyed the experience but after working with it for nearly a full day was wanting a little more. Syntax highlighting, easy commenting, easy indentation, CSS autocomplete and hints, etc. etc. so I started doing some peaking around the editor itself and came to find the editor is something called ACE, a pretty nice little open source IDE (written in JS). This got me thinking … maybe I could insert text directly into the Lightweight IDE browser text box, and submit the form as a save …. hmmm …. not a terrible idea …. just need to scrape the page, find the elements and submit the form via some injected JS. Pretty simple … I did some digging and found the HTML objects I needed by using Firebug when a lightbulb went off … instead of populating the form via a HTML page, why not rather check the HTTP methods it is calling when doing the actual save, since there must be some integration with HANA directly … which is when I came across the mother load … a small file called reposervice.xsjs It seemed that every time I was saving or modifying my objects through the IDE, it was calling this file. After checking out the parameters it was `, it was very clear that the methods and text were easy to simulate. I fired up REST Client and within a couple minutes the concept was POC'ed. Pass your file contents as your body with a path param and a POST and you were off to the races

Using Firefox Rest Client to monitor system calls showed each save, create, delete operation was using a small file called reposervice.xsjs, which references the libraries needed for the repository modifications.

The diagram above displays the HTTP call made when saving/creating a file, and how the IDE initially does a HEAD request for the XSRF token, followed by the HTTP PUT.

The initial HEAD request is to fetch the CSRF Token, secondly the token along with the parameter of mode, path and activate are passed to the URL. Pending you are successful, a JSON message is returned with the status. For those of you are not familiar with Cross-Site-Request-Forgery, you can read about it here: http://en.wikipedia.org/wiki/Cross-site_request_forgery

Once I had this done, I was wondering what the best integration option would be and weighed up a couple options of a simple check in type procedure, but wanted something faster, easier and "click free". Being a bit of a highly iterative developer myself, I find it easier to develop "online", which is why I decided it would be best to do a File System watch of a particular folder and save any changes automatically to my HANA instance. Similar to a dropbox type approach.

I had my POC working nicely, a integration goal defined and set out to start developing the UI/Application in Objective-C (Xcode). I had a template type of app from one of my little SAP Note Viewer applications which could act as a foundation. I threw some code out and pulled some very useful little open source packages in as helpers. Within a couple hours in my evenings each night the app was running nicely and doing what I had expected, modify a file or two in a predefined location and sync up to XS. easy.

Thats generally where development grinds to halt for me, as I envision feature after feature to build a Mac clone of HANA Studio Luckily my senses got the better of me, and I worked on doing a recursive package downloader, the ability to create, rename and delete files and folders and not a HANA Studio rewrite Once this was all done, ironing out the bugs was painful. The cocoa FSEvents stream (File System Events) Class on the mac is not easy to work with and a bear at best. Having to monitor a folder for any modifications, deletes and creates turned into a bit of a logic nightmare. One of the interesting challenges is that if you "delete" a file on the mac file system, it does not get a "delete" FS Event but rather a rename! (Since it goes to the trash/recycle bin!). This leads to having to do multiple … if exists then …. type statements around each file and folder event

UI is another interesting one, I like apps to look somewhat decent … and I spent a good amount of time working on each of the elements in Adobe Photoshop as usual … (Whenever I do a mobile app development talk I mention that I spend close to 40% of entire project time in apps like Photoshop with design work! Most are surprised!)

If you are interested in incorporating some these types of features into your own app, I will be posting a copy of the integration classes on GitHub shortly.

PLEASE KEEP IN MIND: This is exploratory type work with undocumented API's, I would not recommend using this in production, or any important production work (or your important opensap homework!). The reason I shared this was to encourage people to look under the hood and understand the how's and why's of how some of these great new tools work.

I would be interested to hear if anyone has any interesting use-cases for being able to manipulate both HANA repository and DB artifacts from outside of the Studio? Does anyone have any challenges with the HANA Studio today they would like to see changed?

↧

Real-time sentiment rating of movies on SAP HANA One

June 18, 2013, 7:26 pm

Latest and popular articles on SAP ERP

≫ Next: Calling XSJS Service using SAP UI5

≪ Previous: A peek inside xSync and the HANA XS Engine

I am an intern visiting Palo Alto from SAP’s Shanghai office for a month-long project. It’s my first trip to the bay area so I am soaking up all the sun and all the excitement here. Last weekend, I found myself wanting to watch a movie. I searched the internet and found all the new releases listed on rottentomatoes and imdb but it was hard to pick one. I wanted to get a pulse of the movie before I watch it not from the critics but actual movie goers like me. Also, I wanted one which had high buzz not only in US but also in China. So I decided, why don’t I build one myself, after all I am in the heart of Silicon Valley.

I decided to pick SAP HANA One to power my app not just because I got the db & application server in the cloud but also because the platform would support sentiment analysis for English & Simplified Chinese right out-of-the-box! I used the Rotten Tomatoes API to find newly released movies and twitter & Sina Weibo APIs for sentiment for US & China respectively.

Prerequisites

Before we start to build the application, we need to get SAP HANA One developer edition and install SAP HANA Studio. You can get the info here:

"Get your own SAP HANA, developer edition on Amazon Web Services" http://scn.sap.com/docs/DOC-28294

You can find how to get SAP HANA One developer edition in part 1, 2, 5 and how to install SAP HANA Studio in part 3, 4.

Schema

I did most of my work in the HANA Studio which is based on the eclipse IDE so very familiar for Java and other open-source developers.

First, I created a schema and full text index for all the movie metadata, including title, rating, running time, release data, synopsis, etc. Then I used the JTomato (https://github.com/geeordanoh/JTomato) to populate the table.

MOVIE: Stores movie metadata, including the title, rating, runtime, release date, etc.

Then I used Twitter4J (http://twitter4j.org) to search the movie keywords on Twitter. I found that twitter, given just the keyword, did a good job pulling all combinations of the movie name: fast and furious, fast & furious.

TWEET: Stores crawled tweets from Twitter, including ID, time, location, content, etc.

However, I ran into problems while crawling Sina Weibo because they have a strict process for usage of their API. So I decided to use Tencent Weibo instead.

TWEET_ZH: Stores crawled tweets from Tencent Weibo

Next I created a fulltext index and sentiment tables (called VoiceOfCustomer) using the following SQL. Voila! I now have sentiment analysis for all twitter and tencent weibo data!

CREATE FULLTEXT INDEX TWEET_I ON TWEET (CONTENT) CONFIGURATION 'EXTRACTION_CORE_VOICEOFCUSTOMER' ASYNC FLUSH EVERY 1 MINUTESLANGUAGE DETECTION ('EN') TEXT ANALYSIS ON;

CREATE FULLTEXT INDEX TWEET_ZH_I ON TWEET_ZH (CONTENT) CONFIGURATION 'EXTRACTION_CORE_VOICEOFCUSTOMER' ASYNC FLUSH EVERY 1 MINUTESLANGUAGE DETECTION ('ZH') TEXT ANALYSIS ON;

TWEET_I: Used to perform sentiment analysis for the table TWEET

TWEET_ZH_I: Used to perform sentiment analysis for the table TWEET_ZH

In addition to the tables in SAP HANA and the full text index to perform sentiment analysis, I also wrote stored procedures to wrap complex SQL making it easy for XS (HANA’s application server) to consume.

Architecture

The final architecture looks like this:

Rating

Now, I had to create a formula to quantify rating. I used a very simple formula for this:

Score = (# of strong positive sentiment * 5 + # of weak positive sentiment * 4 + # of neutral sentiment * 3 + # of weak negative sentiment * 2 + # of strong negative sentiment *1) / # of total sentiments

This score would be helpful to rank movies so I could easily pick the top one.

Additionally, I showed a distribution of the sentiments, positive vs. negative vs. neutral, so I could better understand how strong or weak people’s opinion was on the movie both in US & in China.

XS Application

The application should be built on XS Engine to prevent data transfer latency between the database and the web application server so users can access the website directly. The application was built in the following steps:

Step 1: Create stored procedures for rating and sentiment analysis

Currently, there are two stored procedures in the app. One is for rating and the other is for sentiment analysis:

1. Rating

We can use the following SQLs to create the type and the stored procedure:

CREATETYPE MOVIEINFO ASTABLE (

POSTER NVARCHAR(100),

TITLE NVARCHAR(100),

RATING DECIMAL(5, 2),

NUM INTEGER,

TITLE_ZH NVARCHAR(100),

RATING_ZH DECIMAL(5, 2),

NUM_ZH INTEGER,

YEARINTEGER,

MPAA_RATING NVARCHAR(100),

RUNTIME NVARCHAR(100),

CRITICS_CONSENSUS NVARCHAR(2000),

RELEASE_DATE DATE,

SYNOPSIS NVARCHAR(2000),

ID INTEGER

);

CREATEPROCEDURE GETMOVIEINFO(OUT RESULT MOVIEINFO) LANGUAGE SQLSCRIPT READS SQL DATA AS

BEGIN

RESULT =

SELECT A.POSTER, A.TITLE, B.RATING, B.NUM, A.TITLE_ZH, C.RATING_ZH, C.NUM_ZH, A.YEAR, A.MPAA_RATING, A.RUNTIME, A.CRITICS_CONSENSUS, A.RELEASE_DATE, A.SYNOPSIS, A.ID

FROM MOVIE A

INNERJOIN

(SELECT ID, CASESUM(NUM) WHEN 0 THEN 0 ELSETO_DECIMAL(SUM(TOTAL) / SUM(NUM), 5, 2) ENDAS RATING, SUM(NUM) AS NUM FROM

(SELECT

A.ID,
C.TA_TYPE,

COUNT(C.TA_TYPE) AS NUM,

CASE C.TA_TYPE

WHEN'StrongPositiveSentiment'THENCOUNT(C.TA_TYPE) * 5

WHEN'WeakPositiveSentiment'THENCOUNT(C.TA_TYPE) * 4

WHEN'NeutralSentiment'THENCOUNT(C.TA_TYPE) * 3

WHEN'WeakNegativeSentiment'THENCOUNT(C.TA_TYPE) * 2

WHEN'StrongNegativeSentiment'THENCOUNT(C.TA_TYPE) * 1

ENDAS TOTAL

FROM MOVIE A

LEFTJOIN TWEET B

ON A.ID = B.MOVIEID

LEFTJOIN"$TA_TWEET_I" C

ON B.ID = C.ID AND C.TA_TYPE IN ('StrongPositiveSentiment', 'WeakPositiveSentiment', 'NeutralSentiment', 'WeakNegativeSentiment', 'StrongNegativeSentiment')

GROUPBY

A.ID,
C.TA_TYPE) A

GROUPBY ID) B ON A.ID = B.ID

INNERJOIN

(SELECT ID, CASESUM(NUM) WHEN 0 THEN 0 ELSETO_DECIMAL(SUM(TOTAL) / SUM(NUM), 5, 2) ENDAS RATING_ZH, SUM(NUM) AS NUM_ZH FROM

(SELECT

A.ID,
C.TA_TYPE,

COUNT(C.TA_TYPE) AS NUM,

CASE C.TA_TYPE

WHEN'StrongPositiveSentiment'THENCOUNT(C.TA_TYPE) * 5

WHEN'WeakPositiveSentiment'THENCOUNT(C.TA_TYPE) * 4

WHEN'NeutralSentiment'THENCOUNT(C.TA_TYPE) * 3

WHEN'WeakNegativeSentiment'THENCOUNT(C.TA_TYPE) * 2

WHEN'StrongNegativeSentiment'THENCOUNT(C.TA_TYPE) * 1

ENDAS TOTAL

FROM MOVIE A

LEFTJOIN TWEET_ZH B

ON A.ID = B.MOVIEID

LEFTJOIN"$TA_TWEET_ZH_I" C

ON B.ID = C.ID AND C.TA_TYPE IN ('StrongPositiveSentiment', 'WeakPositiveSentiment', 'NeutralSentiment', 'WeakNegativeSentiment', 'StrongNegativeSentiment')

GROUPBY

A.ID,
C.TA_TYPE) A

GROUPBY ID) C ON A.ID = C.ID

ORDERBY B.RATING DESC

;

END;

After creating the type and the stored procedure successfully, we can use the following SQL to test:

CALL GETMOVIEINFO(?);

From the column “RATING” and “RATING_ZH”, we can show the score on the main page.

2. Sentiment analysis

We can use the following SQLs to create the type and the stored procedure:

CREATETYPE SENTIMENT ASTABLE (SENTIMENT NVARCHAR(100), NUM INTEGER);

CREATEPROCEDURE GETSENTIMENT(IN ID INTEGER, IN LANG VARCHAR(2), OUT RESULT SENTIMENT) LANGUAGE SQLSCRIPT READS SQL DATA AS

BEGIN

IF LANG = 'EN'THEN

RESULT = SELECT'Strong Positive'AS SENTIMENT, COUNT(*) AS NUM FROM"$TA_TWEET_I" A