Introduction
I found interesting the idea of a pure HTML5 Remote Desktop client for cross-browser cross-platform access to a PC. Despite that there are some AJAX VNC/RFB implementations, I thought in a simpler way to build a remote desktop solution using web standards and simple image processing.This solution is built using AJAX, HTML5, JSON and simple JPEG/PNG images. The required server-side code is written in Delphi 2010.
The code - Part I
In this first part we'll see the code to do the screen capture. The standard aproach is to capture the whole desktop. However, in this case, we'll capture every window individually, applying clipping regions and saving the individual bitmap for later comparison and difference extracting.Firstly we need to enumerate all visible top windows:
TWin = class(TObject) private Wnd : Hwnd; Rect : TRect; Pid : Cardinal; public constructor Create(AWnd:HWND;ARect:TRect;APid:Cardinal); end; function EnumWindowsProc(Wnd: HWnd; const obj:TList<TWin>): Bool; export; stdcall; var ProcessId : Cardinal; R,R1 : TRect; Win : TWin; begin Result:=True; GetWindowThreadProcessId(Wnd,ProcessId); if IsWindowVisible(Wnd) and not IsIconic(wnd)then begin GetWindowRect(Wnd,R); IntersectRect(R1,R,Screen.DesktopRect); if not IsRectEmpty(R1) then begin win := TWin.Create(Wnd,R,ProcessId); obj.Add(win); end; end; end; procedure GetProcessWindowList(WinList:TList<TWin>); begin WinList.Clear; EnumWindows(@EnumWindowsProc, Longint(WinList)); end;
We want to keep a list of windows, with their basic attributes and their
bitmaps, so we can compare with the new ones and send the differences to the
client. Here we merge the window list into a list of TWindowMirror:
TWindowMirror = class private FIndex : Integer; FRgn : HRGN; FHandle : THandle; FBoundsRect : TRect; FProcessId : Integer; FImage : TBitmap; FDiffStreamList : TList<TImagePart>; ... ... end; procedure TMirrorManager.RefreshMirrorList(out OneMoved:Boolean); procedure GetProcessWindowList(WinList:TList<TWin>); begin WinList.Clear; EnumWindows(@EnumWindowsProc, Longint(WinList)); end; var wl : TList<TWin>; n : Integer; wm : TWindowMirror; begin OneMoved:=False; wl := TList<TWin>.Create; try // Enumerates top windows GetProcessWindowList(wl); try for n := wl.Count - 1 downto 0 do begin // Looks for a cached window wm:=GetWindowMirror(FMirrorList,wl[n].Wnd); if assigned(wm) then begin if IsIconic(wl[n].Wnd) then wm.SetBoundsRect(Rect(0,0,0,0)) else wm.SetBoundsRect(wl[n].Rect); // Returns true when at least one window moved OneMoved:=OneMoved or (DateTimeToTimeStamp(Now-wm.FMoved).time<MOVE_TIME); end else begin // Do not create a TWindowMirror for invisible windows if IsIconic(wl[n].Wnd) then Continue; wm:=TWindowMirror.Create(Self,wl[n].Wnd,wl[n].Rect, wl[n].pid); FMirrorList.Add(wm); end; // Saves the zIndex wm.FIndex:=wl.Count-n; // Generates clipping regions wm.GenRegions(wl,n); end; finally ClearList(wl); end; // Sorts the mirror list by zIndex FMirrorList.Sort; finally wl.free; end; end;
function TWindowMirror.Capture(ANewImage:TBitmap): Boolean; function BitBlt(DestDC: HDC; X, Y, Width, Height: Integer; SrcDC: HDC; XSrc, YSrc: Integer; Rop: DWORD): BOOL; begin // Capture only visible regions SelectClipRgn(DestDC,FRgn); result:=Windows.BitBlt(DestDC, X, Y, Width, Height, SrcDC, XSrc, YSrc, Rop); SelectClipRgn(DestDC,0); end; var DC : HDC; RasterOp,ExStyle: DWORD; begin RasterOp := SRCCOPY; ExStyle:=GetWindowLong(FHandle, GWL_EXSTYLE); if (ExStyle and WS_EX_LAYERED) = WS_EX_LAYERED then RasterOp := SRCCOPY or CAPTUREBLT; DC := GetDCEx(FHandle,0,DCX_WINDOW or DCX_NORESETATTRS or DCX_CACHE); try Result:=BitBlt(ANewImage.Canvas.Handle,0,0, Width(FBoundsRect),Height(FBoundsRect),DC,0,0, RasterOp) finally ReleaseDC(FHandle,DC); end; end;
Now that we have captured all visible regions we need to get the bitmap
differences against the previous capture. We do this by looping through the
windows, then their visible regions and finally calculating the regions where we
find bitmap differences:
function TWindowMirror.CaptureDifferences(reset:boolean=false): Boolean; .... begin ... result:=Capture(TmpImage); if result then begin ... ra:=ExtractClippingRegions(Rect(0,0,TmpImage.Width,TmpImage.Height)); for n := 0 to Length(ra) - 1 do begin ra2:=GetDiffRects(FImage,TmpImage,ra[n]); for m := 0 to Length(ra2) - 1 do begin Jpg := TJpegImage.Create; ... CopyBmpToJpg(Jpg,TmpImage,ra2[m]); FDiffStreamList.Add(TImagePart.Create(rbmp,'jpeg')); Jpg.SaveToStream(FDiffStreamList[FDiffStreamList.Count-1].FStream); ... Bitblt(FImage.Canvas.Handle, ra2[m].Left,ra2[m].Top,Width(ra2[m]),Height(ra2[m]), TmpImage.Canvas.handle, rbmp.Left,ra2[m].Top,SRCCOPY); end; end; ... end;
On the next post we'll focus on the protocol with the client and the client code.
HTML5 remote desktop open source
You can download the full source code from Source Forge.UPDATE: There's a commercial version with PRO features available here.